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REGULATORS OF BIOFILM FORMATION 
5 AND USES THEREOF 

Background of the Invention 
This application claims benefit of U.S. provisional applications 60/303,286 and 
60/373,233, filed July 6, 2001 and April 16, 2002, respectively. The disclosures of 
1 0 which are hereby incorporated by reference. 

This invention relates to nucleic acid and amino acid sequences of genes 
regulating bacterial biofilm formation and to the use of these sequences as targets in the 
diagnosis, treatment, and prevention of bacterial infection and pathogenesis. In addition, 
the invention relates to screening methods for identifying modulators of bacterial biofilm 
1 5 formation and the development of antibacterial treatments. 

Bacteria possess the ability to form aggregated, organized, colonial communities 
called biofilms. Distinct from their free-living planktonic counterparts, bacterial cells 
that form biofilms secrete an exopolysacharide slime that surrounds and protects the 
bacterial colony. By adhering to each other and to surfaces or interfaces, these matrix- 
20 enclosed bacterial populations can cause any number of problems. By attaching to a 
variety of surfaces such as contact lenses, water pipes, hip replacements and food 
packaging, they can cause irritation, disease, immune rejection, and food poisoning. 

In addition to attaching to abiotic surfaces, many biofilm-forming bacteria 
colonize living tissue where they cause serious infection. For example, Pseudomonas 
25 aemginosa colonizes the lungs of cystic fibrosis (CF) patients as a biofilm. Chronic 
colonization of the airways by this bacterial pathogen leads to debilitating exacerbation 
of pulmonary infection and constitutes the major cause of morbidity and mortality in CF 
populations. Colonization of the CF lung by P: aeruginosa generally persists despite the 
use of long-term antibiotic therapy, since antibiotic treatment rarely results in complete 
30 eradication of the infection. 
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As current antibiotic therapies offer limited effectiveness in treating biofilm 
infection, a need exists for developing therapeutic agents that prevent biofilm formation. 
The discovery of polypeptides that regulate biofilm formation and polynucleotides 
encoding such polypeptides fulfills a need in the art by providing new compositions that 
5 are useful in the diagnosis, treatment, and prevention of bacterial infection and 
pathogenesis, as well as biofilm formation in both industrial and medical settings. 

Summary of the Invention 
As is described in more detail below, we have discovered a regulatory system 

10 that modulates microbial phenotypic switching. In one aspect, the invention features an 
isolated polypeptide that includes an amino acid sequence that is at least 50% (and 
preferably 60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95-99%) identical to the amino 
acid sequence of PvrR (SEQ ID NO:2), wherein expression of the polypeptide, in a 
microorganism, affects phenotype-mediated antibiotic-resistance in the microorganism. 

15 In preferred embodiments, the polypeptide includes the amino acid sequence of PvrR 
(SEQ ID NO:2) or consists essentially of the amino acid sequence of PvrR (SEQ ID 
NO:2) or a fragment thereof. 

In a related aspect, the invention features an isolated polypeptide fragment of an 
isolated polypeptide that includes an amino acid sequence having at least 50% identity to 

20 the amino acid sequence of PvrR (SEQ ID NO:2). In preferred embodiments, such a 
polypeptide fragment includes at least 300 contiguous amino acid residues of the amino 
acid sequence of PvrR (SEQ ID NO;2). In other embodiments, the fragment is at least 
250 amino acid residues, 200 amino acid residues, or 100 amino acid residues of the 
amino acid sequence of PvrR (SEQ ID NO:2). 

25 In another aspect, the invention features an isolated polynucleotide having at 

least 50% identity to the nucleotide sequence of pvrR (SEQ ID NO:l), wherein 
expression of the polynucleotide, in a microorganism, affects phenotype-mediated 
antibiotic-resistance in the microorganism. In preferred embodiments, the isolated 
polynucleotide includes the nucleotide sequence of pvrR (SEQ ID NO:l) or a 
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complement thereof. In yet other preferred embodiments, the polynucleotide consists 
essentially of the nucleotide sequence of pvrR (SEQ ID NO:l) or a fragment thereof. 

In still other related aspects, the invention features a vector including any of the 
aforementioned isolated polynucleotides and a host cell that includes the vector. 
5 The invention further features a variety of screening assays for identifying 

compounds that modulate phenotype-mediated antibiotic-resistance, biofilm formation, 
or biofilm-mediated antibiotic resistance. For example, the invention features a 
screening method that is useful for identifying a compound that modulates the gene 
expression of a regulator polynucleotide that affects phenotype-mediated antibiotic- 

10 resistance in a microorganism. Such a method includes the steps of: (a) providing a 
microbial cell (e.g., Pseudomonas, Vibrio, Salmonella, or Staphylococcus) that includes 
a polynucleotide having at least 50% identity to the nucleotide sequence of pvrR (SEQ 
ID NO:l)(or a nucleotide sequence that is substantially identical to pvrR), wherein 
expression of the polynucleotide, in the microbial cell, affects phenotype-mediated 

15 antibiotic-resistance in the microbial cell; (b) contacting the microbial cell with a 

compound; and (c) comparing the level of gene expression of the polynucleotide in the 
presence of the compound with the level of gene expression in the absence of the 
compound; wherein a measurable difference in gene expression indicates that the 
compound modulates gene expression of a regulator polynucleotide that affects 

20 phenotype-mediated antibiotic-resistance in a microorganism. 

In preferred embodiments, the screening method identifies a compound that 
increases or decreases transcription of the regulator polynucleotide. In other 
embodiments, the screening method identifies a compound that increases or decreases 
translation of an mRNA transcribed from the regulator polynucleotide. 

25 In other preferred embodiments, the microbial cell is a phenotypic variant (e.g., a 

small colony variant) having increased biofilm formation. Preferably, the small colony 
variant is a small colony variant of Pseudomonas, Vibrio, Salmonella, or 
Staphylococcus. In still other embodiments, the small colony variant is a rough small 
colony variant, for example, a rough small colony variant of Pseudomonas, Vibrio, 
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Salmonella, or Staphylococcus. In a preferred embodiment, the rough small colony 
variant is Pseudomonas aeruginosa PA14 RSCV. 

In other preferred embodiments, the activity of the compound used in the 
screening assay is dependent upon the presence of the pvrR gene (SEQ ID NO:l) or a 

5 functional equivalent thereof. For example, the identified compound targets and 

interacts with the pvrR gene (SEQ ID NO:l) or a functional equivalent thereof. In still 
other preferred embodiments, the expression of the regulator polynucleotide mediates 
phenotypic switching of the microbial cell in the presence of a high concentration of an 
antibiotic. In other preferred embodiments of the screening method, the polypeptide is 

0 expressed using an isolated polynucleotide that expresses a polypeptide having an amino 
acid sequence having at least 50% identity to the amino acid sequence of PvrR (SEQ ID 
NO:2) or a fragment thereof. 

In another aspect, the invention features a screening method for identifying a 
compound that modulates an activity of a polypeptide that affects phenotype-mediated 

5 antibiotic-resistance in a microorganism. The method, in general, includes the steps of: 
(a) providing a microbial cell expressing a polypeptide having at least 50% identity to 
the amino acid sequence of PvrR (SEQ ID NO:2) (or a polypeptide that is substantially 
identical to PvrR), wherein expression of the polypeptide, in the microbial cell, affects 
phenotype-mediated antibiotic-resistance in the microbial cell; (b) contacting the 

0 microbial cell with a compound; and (c) comparing an activity of the polypeptide in the 
presence of the compound with the activity in the absence of the compound; wherein a 
measurable difference in the activity indicates that the compound modulates the activity 
of the polypeptide that affects phenotype-mediated antibiotic-resistance in a 
microorganism. In preferred embodiments, the screening method identifies a compound 

5 that increases or decreases the activity of the polypeptide. Comparison of the activity of 
the polypeptide includes a variety of standard biochemical analyses including 
immunological assays. 

In preferred embodiments, the microbial cell utilized in the screening assay is a 
phenotypic variant (e.g., Pseudomonas aeruginosa PA14 RSCV) having increased 

0 biofilm formation relative to wild-type. 
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In other preferred embodiments, the regulator polypeptide is an isolated 
polypeptide that includes an amino acid sequence having at least 50% identity to the 
amino acid sequence of PvrR (SEQ ID NO:2) (or a polypeptide that is substantially 
identical to PvrR). In particular, such a polypeptide has the ability to regulate 
5 phenotypic switching; to regulate biofilm-mediated antibiotic-resistance; to mediate 
phenotypic switching of the microbial cell in the presence of a high concentration of an 
antibiotic; or to affect susceptibility of the microbial cell to antibiotic treatment; or to 
regulate, or mediate, or affect, or any combination of the aforementioned activities 
thereof. In other preferred embodiments, the regulator polypeptide is an element of a 

1 0 two-component regulatory system. In yet other preferred embodiments, the polypeptide 
is expressed by an isolated polynucleotide having at least 50% identity to the nucleotide 
sequence of pvrR (SEQ ID NO: 1) or a fragment thereof. 

Typically, the activity of the compound identified in the screening assay is 
dependent upon the presence of the PvrR polypeptide (SEQ ID NO:2) or a functional 

15 equivalent thereof. In particular aspects of the screening assay, the compound targets 
the PvrR polypeptide (SEQ ID NO:2) or a functional equivalent thereof. 

In another aspect, the invention features a screening method for identifying a 
compound that modulates microbial biofilm formation. This method, in general, 
includes the steps of: (a) culturing a microbial cell (e.g., Pseudomonas, Vibrio, 

20 Salmonella, or Staphylococcus) that includes a polypeptide having at least 50% identity 
to the amino acid sequence of PvrR (SEQ ED NO:2) (or a polypeptide that is 
substantially identical to PvrR), wherein the microbial cell, upon culturing, forms a 
biofilm; (b) contacting the microbial cell with a compound; and (c) comparing microbial 
biofilm formation in the presence of the compound with microbial biofilm formation in 

25 the absence of the compound; wherein a measurable difference in the microbial biofilm 
formation indicates that the compound modulates biofilm formation. 

In preferred embodiments, the screening method identifies a compound that 
increases or decreases biofilm formation. Typically, such biofilm formation is measured 
by using any standard method, for example, by assaying microbial aggregation (e.g., by 

30 using a microscope); using a salt aggregation test; or by using an attachment assay. 
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In preferred embodiments, the microbial cell is a phenotypic variant having 
increased biofilm formation when compared to its wild-type such as a small colony 
variant of Pseudomonas, Vibrio, Salmonella, or Staphylococcus. In other preferred 
embodiments, the small colony variant is a rough small colony variant of Pseudomonas, 
5 Vibrio, or Salmonella. In a preferred embodiment, the rough small colony variant is 
Pseudomonas aeruginosa PA14 RSCV. 

In yet other preferred embodiments, the activity of the compound utilized in the 
screening assay is dependent upon the presence of PvrR polypeptide (SEQ ID NO: 2) or 
a functional equivalent thereof. For example, the identified compound targets and 
10 interacts with the PvrR polypeptide (SEQ ID NO:2) or a functional equivalent thereof, 
resulting in increasing or decreasing its functional activity. 

In still another embodiment, the expression of the polypeptide mediates 
phenotypic switching of the microbial cell in the presence of a high concentration of an 
antibiotic. 

1 5 In another embodiment, the polypeptide is an isolated polypeptide that includes 

an amino acid sequence having at least 50% identity to the amino acid sequence of PvrR 
(SEQ ID NO:2) 3 wherein expression of the polypeptide, in a microorganism, affects 
phenotype-mediated antibiotic-resistance in the microorganism. 

In still another aspect, the invention features a method of treating a microbial 

20 infection involving a microorganism that forms a biofilm in a mammal. The method, in 
general, includes administering to the mammal a therapeutically-effective amount of a 
compound that induces or represses expression or activity of a polypeptide that includes 
an amino acid sequence having at least 50% identity to the amino acid sequence of PvrR 
(SEQ ID NO:2) (or a polypeptide that is substantially identical to PvrR) or a fragment 

25 thereof, wherein expression of the polypeptide or the fragment thereof, in a 

microorganism, affects phenotype-mediated antibiotic-resistance in the microorganism. 

In preferred embodiments, the method further includes administering to the 
mammal a therapeutically-effective amount of an antibiotic. The treatment is 
particularly useful for treating patients having cystic fibrosis or a chronic microbial 
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infection or both. In other preferred embodiments, the microorganism treated using the 
method belongs to the genus Pseudomonas, Vibrio, Salmonella, or Staphylococcus, 

In yet another aspect, the invention features a method of cleaning, disinfecting, 
or decontaminating a surface at least partially covered by a microorganism that forms a 

5 biofilm, the method involving contacting the microorganism with a cleaning 

composition including a compound that induces or represses expression or activity of a 
polypeptide that includes an amino acid sequence having at least 50% identity to the 
amino acid sequence of PvrR (SEQ ID NO:2) (or a polypeptide that is substantially 
identical to PvrR) or fragment thereof, wherein expression of the polypeptide or the 

0 fragment thereof, in a microorganism, affects phenotype-mediated antibiotic-resistance 
in the microorganism. 

In yet another aspect, the invention features a screening method for identifying a 
compound that decreases pathogenicity of an antibiotic-resistant phenotypic variant. 
The method, in general, includes the steps of: (a) contacting an antibiotic-resistant 

5 phenotypic variant with a candidate compound; and (b) measuring reversion of the 

antibiotic-resistant phenotypic variant to a wild-type phenotype, an increase in reversion 
indicating that the compound decreases pathogenicity of the antibiotic-resistant 
phenotypic variant. In preferred embodiments, the antibiotic-resistant phenotypic 
variant is cultured in the absence of an antibiotic, has increased biofilm formation; is a 

0 rough small colony variant; is a hyperpiliated variant; has increased hydrophobicity; has 
an alteration in a surface component; or is a pathogen such as a Gram positive bacterium 
(e.g., Staphylococcus) or a Gram negative bacterium (e.g., Vibrio, Pseudomonas, or 
Salmonella). 

In another aspect, the invention features a screening method for identifying a 
5 compound that decreases pathogenicity of an antibiotic-resistant phenotypic variant. 
The method, in general, includes the steps of: (a) culturing an antibiotic-resistant 
phenotypic variant with a candidate compound in the presence of an antibiotic; and (b) 
comparing the number of antibiotic-resistant phenotypic variants in the presence of the 
compound to the number of antibiotic-resistant phenotypic variants in the absence of the 
0 compound, a decrease in the number of the antibiotic-resistant phenotypic variants in the 
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presence of the compound indicating that the compound decreases pathogenicity of the 
antibiotic-resistant phenotypic variant. 

In yet another aspect, the invention features a screening method for identifying a 
polynucleotide encoding a regulator polypeptide, the method including the steps of: (a) 
5 providing a mutagenized microbe; (b) culturing the mutagenized microbe in the 
presence of an antibiotic; and (c) comparing the mutagenized microbe with a control 
wild-type microbe, wherein a change in the number of phenotypic variants identifies the 
mutagenized microbe as having a mutation in a polynucleotide encoding a regulator 
polypeptide. In preferred embodiments, the phenotypic variant is a small colony variant. 

10 In another aspect, the invention features a screening method for identifying a 

polynucleotide encoding a regulator polypeptide that modulates an antibiotic-resistant 
phenotype of a microorganism. The method, in general, includes the steps of: (a) 
identifying an antibiotic-resistant phenotypic variant of a microorganism including a 
first phenotype; (b) mutagenizing the antibiotic-resistant phenotypic variant of the 

1 5 microorganism, thereby generating a mutated phenotypic variant of the microorganism; 
and (c) selecting the mutated phenotypic variant of step (b) having a second phenotype, 
other than the first phenotype of the antibiotic-resistant phenotypic variant, wherein the 
second phenotype identifies a mutation in the mutated phenotypic variant of step (b); 
and (d) using the mutation for identifying a polynucleotide encoding a regulator 

20 polypeptide that modulates an antibiotic-resistant phenotype of a microorganism. In 
preferred embodiments, the second phenotype includes a wild-type phenotype. 

In yet another aspect, the invention features a screening method for identifying a 
polynucleotide encoding a regulator polypeptide that modulates phenotype-mediated 
antibiotic-resistance of a microorganism. The method, in general, includes the steps of: 

25 (a) transforming an antibiotic-resistant phenotypic variant of a microorganism with a 
candidate polynucleotide encoding a regulator polypeptide; and (b) culturing the 
transformed antibiotic-resistant phenotypic variant of a microorganism under conditions 
suitable for expression of the regulator polypeptide; and (c) measuring reversion of the 
transformed antibiotic-resistant phenotypic variant of the microorganism to a wild-type 
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phenotype, an increase in reversion identifies the polynucleotide as encoding a regulator 
polypeptide that modulates phenotype-mediated antibiotic-resistance. 

In preferred embodiments, the polynucleotide encodes a regulator polypeptide 
that modulates a phenotypic switch from an antibiotic-resistant phenotype to an 
5 antibiotic-susceptible phenotype. In other preferred embodiments, the candidate 
polynucleotide has at least .50% identity to the nucleotide sequence of pvrR (SEQ ID 
NO: 1) (or a polynucleotide sequence that is substantially identical to pvrR). In other 
embodiments, the candidate polynucleotide sequence is substantially identical to any one 
of the polynucleotides shown in Figures 5B, 5C, 6A-6K, and 7A-7E . In other preferred 
10 embodiments, the candidate polynucleotide encodes a polypeptide that is an element of a 
two-component regulatory system. 

In another aspect, the invention features an isolated polypeptide including an 
amino acid sequence that is substantially identical to the amino acid sequence of any one 
the polypeptides shown in Figures 5E (SEQ ID NO: 4) and 6L-6V (SEQ ID NOS: 19- 
15 29), each of which are encoded by a polynucleotide of the ORF1 region. 

For example, with respect to the ORF1 region, the invention features an isolated 
polypeptide that includes an amino acid sequence that is at least 5.0% (and preferably 
60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95-99%) identical to the amino acid 
sequence of the polypeptide shown in Figure 5E (SEQ ID NO: 4) or to a polypeptide 
20 shown in Figures 6L-6V (SEQ ID NOS : 1 9-29), wherein expression of the polypeptide, 
in a microorganism, affects phenotype-mediated antibiotic-resistance in the 
, microorganism. Preferably, the polypeptide includes the amino acid sequence shown in 
Figure 5E or consists essentially of the amino acid sequence shown in Figure 5E or a 
fragment thereof 

25 In a related aspect, the invention features an isolated polypeptide fragment of an 

isolated polypeptide that includes an amino acid sequence having at least 50% identity to 
the amino acid sequence the polypeptide shown in Figure 5E or to a polypeptide shown 
in any one of Figures 6L-6V. In preferred embodiments, such a polypeptide fragment 
includes at least 400 contiguous amino acid residues of the amino acid sequence shown 

30 in any one of Figures 5E and 6L-6V. In other embodiments, the fragment is at least 300 
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amino acid residues, 200 amino acid residues, or 100 amino acid residues of the 
polypeptides shown in Figures 5E and 6L-6V. 

In another aspect, the invention features an isolated polynucleotide molecule 
including a sequence substantially identical to any one of the polynucleotides shown in 
5 Figures 5B (SEQ ID NO:3) and 6A-6K (SEQ ID NOS: 8-18), which are found in the 
ORF1 region. In preferred embodiments, the isolated polynucleotide molecule has at 
least 45%, 50%, 60%, 70%, 80%, 90%, or even 95-99% identity to any one of these 
isolated molecules. 

For example, with respect to the ORF1 region, the invention features an isolated 

10 polynucleotide having at least 50% identity to the nucleotide sequence shown in Figure 
5B or to any one of the nucleotide sequences shown in Figures 6A-6K, wherein 
expression of the polynucleotide, in a microorganism, affects phenotype-mediated 
antibiotic-resistance in the microorganism. In preferred embodiments, the isolated 
polynucleotide includes the nucleotide sequence shown in Figure 5B or a complement 

15 thereof. In yet other preferred embodiments, the polynucleotide consists essentially of 
the nucleotide sequence shown in Figure 5B or a fragment thereof. 

In still other related aspects, the invention features a vector including any of the 
aforementioned isolated polynucleotides and a host cell that includes the vector. 
The invention further features a variety of screening assays for identifying 

20 compounds that modulate phenotype-mediated antibiotic-resistance, biofilm formation, 
or biofilm-mediated antibiotic resistance. For example, the invention features a 
screening method that is useful for identifying a compound that modulates the gene 
expression of a regulator polynucleotide that affects phenotype-mediated antibiotic- 
resistance in a microorganism. Such a method includes the steps of: (a) providing a 

25 microbial cell (e.g., Pseudomonas, Vibrio, Salmonella, or Staphylococcus) that includes 
a polynucleotide that is substantially identical to any one of the nucleotide sequences 
shown in Figures 5B or 6A-6K (or a polynucleotide having at least 40% identity to any 
one of these sequences), wherein expression of the polynucleotide, in the microbial cell, 
affects phenotype-mediated antibiotic-resistance in the microbial cell; (b) contacting the 

30 microbial cell with a compound; and (c) comparing the level of gene expression of the 
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polynucleotide in the presence of the compound with the level of gene expression in the 
absence of the compound; wherein a measurable difference in gene expression indicates 
that the compound modulates gene expression of a regulator polynucleotide that affects 
phenotype-mediated antibiotic-resistance in a microorganism. 

5 In preferred embodiments, the screening method identifies a compound that 

increases or decreases transcription of the regulator polynucleotide. In other 
embodiments, the screening method identifies a compound that increases or decreases 
translation of anjmRNA transcribed from the regulator polynucleotide. 

In other preferred embodiments, the microbial cell is a phenotypic variant (e.g., a 

0 small colony variant) having increased biofilm formation. Preferably, the small colony 
variant is a small colony variant of Pseudomonas, Vibrio, Salmonella, or 
Staphylococcus. In still other embodiments, the small colony variant is a rough small 
colony variant, for example, a rough small colony variant of Pseudomonas, Vibrio, 
Salmonella, or Staphylococcus. In a preferred embodiment, the rough small colony 

5 variant is Pseudomonas aeruginosa PA1 4 RSCV. 

In other preferred embodiments, the activity of the compound used in the 
screening assay is dependent upon the presence of any one of the polynucleotides shown 
in Figures 5B or 6A-6K, or a functional equivalent thereof. For example, the identified 
compound targets any one of the polynucleotides shown in Figures 5B or 6A-6K or a 

0 functional equivalent thereof. In still other preferred embodiments, the expression of the 
regulator polynucleotide mediates phenotypic switching of the microbial cell in the 
presence of a high concentration of an antibiotic. In other preferred embodiments of the 
screening method, the polypeptide is expressed using an isolated polynucleotide that 
encodes a polypeptide that is substantially identical to any one of the polynucleotides 

5 shown Figures 5B and 6A-6K or a fragment thereof. 

In another aspect, the invention features a screening method for identifying a 
compound that modulates an activity of a polypeptide that affects phenotype-mediated 
antibiotic-resistance in a microorganism. The method, in general, includes the steps of: 
(a) providing a microbial cell expressing a polypeptide that is substantially identical to 

3 any one of the polypeptides shown in Figures 5E and 6L-6V (or a polypeptide having at 
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least 40% identity to any one of these sequences), wherein expression of the 
polypeptide, in the microbial cell, affects phenotype-mediated antibiotic-resistance in the 
microbial cell; (b) contacting the microbial cell with a compound; and (c) comparing an 
activity of the polypeptide in the presence of the compound with the activity in the 
5 absence of the compound; wherein a measurable difference in the activity indicates that 
the compound modulates the activity of the polypeptide that affects phenotype-mediated 
antibiotic-resistance in a microorganism. In preferred embodiments, the screening 
method identifies a compound that increases or decreases the activity of the polypeptide. 
Comparison of the activity of the polypeptide includes a variety of standard biochemical 

10 analyses including immunological assays. 

In preferred embodiments, the microbial cell utilized in the screening assay is a 
phenotypic variant (e.g., Pseudomonas aeruginosa PA 14 RSCV) having increased 
biofilm formation. 

In other preferred embodiments, the regulator polypeptide is an isolated 

15 polypeptide that includes an amino acid sequence that is substantially identical to any 
one of the polypeptides shown in Figures 5E and 6L-6V (or a polypeptide having at least 
40% identity to any one of these sequences). In particular, such a polypeptide has the 
ability to regulate phenotypic switching; to regulate biofilm-mediated antibiotic- 
resistance; to mediate phenotypic switching of the microbial cell in the presence of a 

20 high concentration of an antibiotic; or to affect susceptibility of the microbial cell to 
antibiotic treatment; or any combination thereof. In other preferred embodiments, the 
regulator polypeptide is an element of a two-component regulatory system. In yet other 
preferred embodiments, the polypeptide is expressed by an isolated polynucleotide that 
is substantially identical to any one of the nucleotide sequences shown in Figures 5B\and 

25 6A-6K (or a polynucleotide having at least 40% identity to any one of these sequences) 
or a fragment thereof, upon which the activity of the regulator polypeptide is increased 
or decreased. 

Typically, the activity of the compound identified in the screening assay is 
dependent upon the presence of any one of the polypeptides shown in Figures 5E and 
30 6L-6V or a functional equivalent thereof. In particular aspects of the screening assay, 
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the compound targets or interacts with any one of the polypeptides shown in Figures 5E 
and 6L-6V or a functional equivalent thereof. 

In another aspect, the invention features a screening method for identifying a 
compound that modulates microbial biofilm formation. This method, in general, 
5 includes the steps of: (a) culturing a microbial cell (e.g., Pseudomonas, Vibrio, 

Salmonella, or Staphylococcus) that includes a polypeptide that is substantially identical 
to any one of the polypeptides shown in Figures 5E and 6L-6V (or a polypeptide having 
at least 40% identity to any one of these sequences), wherein the microbial cell, upon 
culturing, forms a biofilm; (b) contacting the microbial cell with a compound; and (c) 
10 comparing microbial biofilm formation in the presence of the compound with microbial 
biofilm formation in the absence of the compound; wherein a measurable difference in 
the microbial biofilm formation indicates that the compound modulates biofilm 
formation. 

In preferred embodiments, the screening method identifies a compound that 
15 increases or decreases biofilm formation. Typically, such biofilm formation is measured 
by using any standard method, for example, by assaying microbial aggregation (e.g., by 
using a microscope); using a salt aggregation test; or by using an attachment assay. 

In preferred embodiments, the microbial cell is a phenotypic variant having 
increased biofilm formation when compared to its wild-type such as a small colony 
20 variant of Pseudomonas, Vibrio, Salmonella, or Staphylococcus. In other preferred 

embodiments, the small colony variant is a rough small colony variant of Pseudomonas, 
Vibrio, or Salmonella. 

In yet other preferred embodiments, the activity of the compound utilized in the 
screening assay is dependent upon the presence of the polypeptide or a functional 
25 equivalent thereof. For example, the identified compound targets or interacts with the 
polypeptide or a functional equivalent thereof, resulting in increasing or decreasing its 
functional activity. 

In still another embodiment, the expression of the polypeptide mediates 
phenotypic switching of the microbial cell in the presence of a high concentration of an 
30 antibiotic. 
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In another embodiment, the polypeptide is an isolated polypeptide that includes 
an amino acid sequence that is substantially identical to any one of the polypeptides 
shown in Figures 5E and 6L-6V (or a polypeptide having at least 40% identity to any 
one of these sequences), wherein expression of the polypeptide, in a microorganism, 
5 affects phenotype-mediated antibiotic-resistance in the microorganism. 

In still another aspect, the invention features a method of treating a microbial 
infection involving a microorganism that forms a biofilm in a mammal. The method, in 
general, includes administering to the mammal a therapeutically-effective amount of a 
compound that induces or represses expression or activity of a polypeptide that includes 
10 a polypeptide that is substantially identical to any one of the polypeptides shown in 
Figures 5E and 6L-6V or a fragment thereof (or a polypeptide having at least 40% 
identity to any one of these sequences), wherein expression of the polypeptide or the 
fragment thereof, in a microorganism, affects phenotype-mediated antibiotic-resistance 
in the microorganism. 

15 In another aspect, the invention features an isolated polypeptide including an 

amino acid sequence that is substantially identical to the amino acid sequence of any one 
of the polypeptides shown in Figures 5F and Figures 7F-7J, each of which are encoded 
by a polynucleotide of the ORF3 region. 

For example, with respect to the ORF3 region, the invention features an isolated 
20 polypeptide that includes an amino acid sequence that is at least 50% (and preferably 
60%, 65%, 70%, 75%, 80%, 85%, 90%, or 95-99%) identical to the amino acid 
sequence of any one of the polypeptides shown in Figures 5F (SEQ ID NO:6) and 7F-7J 
(SEQ ID NOS:35-39), wherein expression of the polypeptide, in a microorganism, 
affects phenotype-mediated antibiotic-resistance in the microorganism. Preferably, the 
25 polypeptide includes the amino acid sequence shown in Figure 7J (SEQ ID NO:39) or 
consists essentially of the amino acid sequence shown in Figures 5F (SEQ ID NO:6) and 
7F-7I (SEQ ID NOS:35-38) or a fragment thereof. 

In a related aspect, the invention features an isolated polypeptide fragment of an 
isolated polypeptide that includes an amino acid sequence having at least 50% identity to 
30 the amino acid sequence of the polypeptides shown in Figures 5F and 7F-7J. In 
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preferred embodiments, such a polypeptide fragment includes at least 300 contiguous 
amino acid residues of the amino acid sequence shown in any one of Figures 5F and 7F- 
7J. In other embodiments, the fragment is at least 200 amino acid residues, or 100 
amino acid residues of the polypeptides shown in Figures 5F and 7F-7J. 
5 In another aspect the invention features an isolated polynucleotide molecule 

including a sequence substantially identical to any one of the polynucleotides shown in 
Figures 5C (SEQ ID NO:5) and 7A-7E (SEQ ID NOS:30-34). In preferred 
embodiments, the isolated polynucleotide molecule has at least 45%, 50%, 60%, 70%, 
80%, 90%, or even 95% identity to any one of these molecules. 

10 For example with respect to the ORF3 region, the invention features an isolated 

polynucleotide having at least 50% identity to any one of the nucleotide sequences 
shown in Figures 5C and 7A-7E, wherein expression of the polynucleotide, in a 
microorganism, affects phenotype-mediated antibiotic-resistance in the microorganism. 
In preferred embodiments, the isolated polynucleotide includes the nucleotide sequence 

15 shown in Figure 5C or a complement thereof. In yet other preferred embodiments, the 
polynucleotide consists essentially of the nucleotide sequence shown in Figure 5C or a 
fragment thereof. 

In still other related aspects, the invention features a vector including any of the - 
aforementioned isolated polynucleotides and a host cell that includes the vector. 

20 The invention further features a variety of screening assays for identifying 

compounds that modulate phenotype-mediated antibiotic-resistance, biofilm formation; 
or biofilm-mediated antibiotic resistance. For example, the invention features a 
screening method that is useful for identifying a compound that modulates the gene 
expression of a regulator polynucleotide that affects phenotype-mediated antibiotic- 

25 resistance in a microorganism. Such a method includes the steps of: (a) providing a 
microbial cell (e.g., Pseudomonas, Vibrio, Salmonella^ or Staphylococcus) that includes 
a polynucleotide substantially identical to the nucleotide sequences shown in Figures 5C 
and 7A-7E (or a polynucleotide having at least 45% identity to any one of these 
sequences), wherein expression of the polynucleotide, in the microbial cell, affects 

30 phenotype-mediated antibiotic-resistance in the microbial cell; (b) contacting the 
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microbial cell with a compound; and (c) comparing the level of gene expression of the 
polynucleotide in the presence of the compound with the level of gene expression in the 
absence of the compound; wherein a measurable difference in gene expression indicates 
that the compound modulates gene expression of a regulator polynucleotide that affects 
5 phenotype-mediated antibiotic-resistance in a microorganism. 

In preferred embodiments, the screening method identifies a compound that 
increases or decreases transcription of the regulator polynucleotide. In other 
embodiments, the screening method identifies a compound that increases or decreases 
translation of an mRNA transcribed from the regulator polynucleotide. 

10 In other preferred embodiments, the microbial cell is a phenotypic variant (e.g., a 

small colony variant) having increased biofilm formation. Preferably, the small colony 
variant is. a small colony variant of Pseudomonas, Vibrio, Salmonella, or 
Staphylococcus. In still other embodiments, the small colony variant is a rough small 
colony variant, for example, a rough small colony variant of Pseudomonas, Vibrio, 

1 5 Salmonella, or Staphylococcus. In a preferred embodiment, the rough small colony 
variant is Pseudomonas aeruginosa PA 14 RSCV. 

In other preferred embodiments, the activity of the compound used in the 
screening assay is dependent upon the presence of any one of the polynucleotides shown 
in Figures 5C and 7A-7E or a functional equivalent thereof. For example, the identified 

20 compound targets or interacts with any one of the polynucleotides shown in Figures 5C 
and 7A-7E or a functional equivalent thereof. In still other preferred embodiments, the 
expression of the regulator polynucleotide mediates phenotypic switching of the 
microbial cell in the presence of a high concentration of an antibiotic. In other preferred 
embodiments of the screening method, the polypeptide is expressed from an isolated 

25 polynucleotide that expresses a polypeptide that includes an amino acid sequence having 
at least 50% identity to any one of the amino acid sequences shown in Figures 5F and 
7F-7J or a fragment thereof. 

In another aspect, the invention features a screening method for identifying a 
compound that modulates an activity of a polypeptide that affects phenotype-mediated 

30 antibiotic-resistance in a microorganism. The method, in general, includes the steps of: 
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(a) providing a microbial cell expressing a polypeptide that is substantially identical to 
any one of the polypeptides shown in Figures 5F and 7F-7J (or a polypeptide having at 
least 45% identity to any one of these sequences), wherein expression of the 
polypeptide, in the microbial cell, affects phenotype-mediated antibiotic-resistance in the 
5 microbial cell; (b) contacting the microbial cell with a compound; and (c) comparing an 
activity of the polypeptide in the presence of the compound with the activity in the 
absence of the compound; wherein a measurable difference in the activity indicates that 
the compound modulates the activity of the polypeptide that affects phenotype-mediated 
antibiotic-resistance in a microorganism. In preferred embodiments, the screening 

10 method identifies a compound that increases or decreases the activity of the polypeptide. 
Comparison of the activity of the polypeptide includes a variety of standard biochemical 
analyses including immunological assays. 

In preferred embodiments, the microbial cell utilized in the screening assay is a 
phenotypic variant (e.g., Pseudomonas aeruginosa PA14 RSCV) having increased 

1 5 biofilm formation. 

In other preferred embodiments, the regulator polypeptide is an isolated 
polypeptide that includes an amino acid sequence that is substantially identical to any 
one of the polypeptides shown in Figures 5F and 7F-7J (or a polypeptide having at least 
45% identity to any one of these sequences). In particular, such a polypeptide has the 

20 ability to regulate phenotypic switching; to regulate biofilm-mediated antibiotic- 
resistance; to mediate phenotypic switching of the microbial cell in the presence of a 
high concentration of an antibiotic; or to affect susceptibility of the microbial cell to 
antibiotic treatment; or any combination thereof. In other preferred embodiments, the 
regulator polypeptide is an element of a two-component regulatory system; In yet other 

25 preferred embodiments, the polypeptide is expressed by an isolated polynucleotide 

substantially identical to any one of the nucleotide sequences shown in Figures 5C and 
7A-7E (or by a polynucleotide having at least 45% identity to any one of these 
sequences) or a fragment thereof, upon which the activity of the regulator polypeptide is 
increased or decreased. 
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Typically, the activity of the compound identified in the screening assay is 
dependent upon the presence of any one of the polypeptides shown in Figures 5F and 
7F-7 J or a functional equivalent thereof. In particular aspects of the screening assay, the 
compound targets and interacts with the polypeptide of Figures 5F and 7F-7J or a 
5 functional equivalent thereof. 

In another aspect, the invention features a screening method for identifying a 
compound that modulates microbial biofilm formation. This method, in general, 
includes the steps of: (a) culturing a microbial cell (e.g., Pseudomonas, Vibrio, 
Salmonella, or Staphylococcus) that includes a polypeptide substantially identical to any 

10 one of the amino acid sequences shown in Figures 5F and 7F-7J (or a polypeptide 

having at least 45% identity to any one of these sequences), wherein the microbial cell, 
upon culturing, forms a biofilm; (b) contacting the microbial cell with a compound; and 
(c) comparing microbial biofilm formation in the presence of the compound with 
microbial biofilm formation in the absence of the compound; wherein a measurable 

15 difference in the microbial biofilm formation indicates that the compound modulates 
biofilm formation. 

In preferred embodiments, the screening method identifies a compound that 
increases or decreases biofilm formation. Typically, such biofilm formation is measured 
by using any standard method, for example, by assaying microbial aggregation (e.g., by 
20 using a microscope); using a salt aggregation test; or by using an attachment assay. 

In preferred embodiments, the microbial cell is a phenotypic variant having 
increased biofilm formation when compared to its wild-type such as a small colony 
variant of Pseudomonas, Vibrio, Salmonella, or Staphylococcus. In other preferred 
embodiments, the small colony variant is a rough small colony variant of Pseudomonas, 
25 Vibrio, or Salmonella. 

In yet other preferred embodiments, the activity of the compound utilized in the 
screening assay is dependent upon the presence of the polypeptide or a functional 
equivalent thereof. For example, the identified compound targets and interacts with the 
polypeptide or a functional equivalent thereof, resulting in increasing or decreasing its 
30 functional activity. 
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In still another embodiment, the expression of the polypeptide mediates 
phenotypic switching of the microbial cell in the presence of a high concentration of an 
antibiotic. 

In another embodiment, the polypeptide is an isolated polypeptide that includes 
5 an amino acid sequence that is substantially identical to any one of the amino acid 

. sequences shown in Figures 5F and 7F-7J (or a polypeptide having at least 45% identity 
to any one of these sequences), wherein expression of the polypeptide, in a 
microorganism, affects phenotype-mediated antibiotic-resistance in the microorganism. 
In still another aspect, the invention features a method of treating a microbial 
10 infection involving a microorganism that forms a biofilm in a mammal. The method, in 
general, includes administering to the mammal a therapeutically-effective amount of a 
compound that induces or represses expression or activity of a polypeptide that includes 
an amino acid sequence that is substantially identical to any one of the amino acid 
sequences shown in Figures 5F and 7F-7J or a fragment thereof (or a polypeptide having 
15 at least 45% identity to any one of these sequences), wherein expression of the 

polypeptide or the fragment thereof, in a microorganism, affects phenotype-mediated 
antibiotic-resistance in the microorganism. 

In preferred embodiments, the method further includes administering to the 
mammal a therapeutically-effective amount of an antibiotic. The treatment is 
20 particularly useful for treating patients having cystic fibrosis or a chronic infection or 
both. In other preferred embodiments, the microorganism treated using the method 
belongs to the genus Pseudomonas, Vibrio, Salmonella, or Staphylococcus. 

In yet another aspect, the invention features a method of cleaning, disinfecting, 
or decontaminating a surface at least partially covered by a microorganism that forms a 
25 biofilm, the method involving contacting the microorganism with a cleaning 

composition including a compound that induces or represses expression or activity of a 
polypeptide that includes an amino acid sequence having at least 50% identity to the 
amino acid sequence of Figures 5E, 5F, 6L-6V, and 7F-7J or fragment thereof (or a 
polypeptide that is substantially identical to any one of these polypeptides), wherein 
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expression of the polypeptide or the fragment thereof, in a microorganism, affects 
phenotype-mediated antibiotic-resistance in the microorganism. 

The invention also features methods for identifying compounds useful for 
treating a patient having a biofilm infection. The method includes the steps of 
5 contacting a biofilm in vitro with (i) an antibiotic and (ii) a candidate compound (e.g., a 
compound that modulates the expression, at the transcriptional, post-transcriptional, 
translational, or post-translational levels, of a polynucleotide having at least 50% 
identity to any of the polynucleotides described herein (or that is substantially identical 
to a polynucleotide described herein), and determining whether the biofilm grows more 

1 0 slowly than (a) biofilm cells contacted with an antibiotic but not contacted with the test 
compound, and (b) biofilm cells contacted with the candidate compound but not with the 
antibiotic. In another embodiment, the biofilm is contacted with two or more different 
antibiotics. Exemplary antibiotics useful in the method include amikacin, 
aminoglicosides (e.g., tobramycin), aztreonam, carbenicillin, cephalosporins (e.g., 

15 ceftazidime or cefipime), chloramphenicol, gentamicin, levofloxacin, meropenem, 

piperacillin, tazobactam, tetracycline, and quinolones (e.g., ciprofloxacin), A candidate 
compound that reduces biofilm formation in the presence of an antibiotic (or 
combination of different antibiotics), but does not decrease biofilm formation in the 
absence of the antibiotic (or combination of different antibiotics), is a compound that is 

20 useful in combination therapy for treating a patient having a biofilm infection. 

The invention further features a method for treating a patient having a biofilm 
infection, by administering to the patient an antibiofilm combination therapy that 
includes a compound identified as modulating expression, at the transcriptional, post- ~" 
transcriptional, translational, or post-translational levels, of a polynucleotide having at 

25 least 50% identity to any of the polynucleotides described herein (or that is substantially 
identical to a polynucleotide described herein) and one or more antibiotics, including, 
but not limited to, amikacin, aminoglicosides (e.g., tobramycin), aztreonam, 
carbenicillin, cephalosporines (e.g., ceftazidime or cefipime), chloramphenicol, 
gentamicin, levofloxacin, meropenem, piperacillin, tazobactam, tetracycline, and 
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quinolones (e.g., ciprofloxacin), simultaneously or within a period of time (e.g., 14 to 21 
days) sufficient to inhibit the growth of the biofilm. 

Preferably, the compound and antibiotic are administered within fifteen days of 
each other, more preferably within five or ten days of each other, and most preferably 
5 within twenty-four hours of each other or even simultaneously. Exemplary biofilms 
treated according to any of the methods described herein are those formed by bacteria, 
including but not limited to, Pseudomonas, Staphylococcus, Salmonella, Vibrio, 
Haemophilus, Mycobacterium, Helicobacter, Burkltolderia, or Streptococci, 

In a related aspect, the invention also features a method for treating a patient 

10 having a biofilm such as one formed from Pseudomonas (e.g., Pseudomonas 

aeruginosa). In this method, a patient is administered (a) a first compound (e.g., a 
compound that modulates the expression, at the transcriptional, post-transcriptional, 
translational, or post-translational; of a polynucleotide having at least 50% identity to a 
polynucleotide described herein (or that is substantially identical to a polynucleotide 

1 5 described herein)), and (b) one or more antibiotics (such as amikacin, aminoglicosides 
(e.g., tobramycin), aztreonam, carbenicillin, cephalosporines (e.g., ceftazidime or 
cefipime), chloramphenicol, gentamicin, levofloxacin, meropenem, piperacillin, 
tazobactam, tetracycline, and quinolones (e.g., ciprofloxacin). If desired, the therapy 
includes administration of two antibiotics according to standard methods known in the 

20 art. Such dual antibiotic combinations most preferably include high-dose tobramycin 
plus meropenem, meropenem plus ciprofloxacin, or tobramycin (4 Jig/ml), or cefipime. 
Other preferred combinations include piperacillin plus tazobactam, or piperacillin' plus 
ciprofloxacin. The antibiotic and compound combination therapy are preferably 
administered simultaneously or within a period of time sufficient to inhibit the growth of 

25 the biofilm. 

In any of the foregoing treatments, the compound and antibiotic included in the 
combination therapy are preferably administered to the patient as part of a 
pharmaceutical composition that also includes a pharmaceutically acceptable carrier. 
Preferred modes of administration include intramuscular, intravenous, inhalation, and 
30 oral administration, or a combination thereof. 
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The antibiofilm combinations of the invention can also be part of a 
pharmaceutical kit. Preferably, the first compound (e.g., a compound identified as 
modulating expression, at the transcriptional, post-transcriptional, translational, or post- 
translational levels, of a polynucleotide or polypeptide having at least 50% identity to 
5 any one of the polynucleotide or polypeptide sequences described herein (or that is 
substantially identical to any one of the polynucleotides or polypeptides described 
herein)) and the second compound, an antibiotic, are formulated together or separately 
and in individual dosage amounts. 

Combination therapy may be provided wherever antibiotic treatment is 

10 performed: at home, the doctor's office, a clinic, a hospital's outpatient department, or a 
hospital. Treatment generally begins at a hospital so that the doctor can observe the 
therapy's effects closely and make any adjustments that are needed. The duration of the 
combination therapy depends on the kind of biofilm being treated, the age and condition 
of the patient, the stage and type of the patient's biofilm infection, and how the patient's 

1 5 body responds to the treatment. Drug administration may be performed at different 
intervals (e.g., daily, weekly, or monthly) and the administration of each agent can be 
determined individually. Combination therapy may be given in on-and-off cycles that 
include rest periods so that the patient's body has a chance to build healthy new cells 
and regain its strength. 

20 By "isolated polynucleotide" is meant a nucleic acid (e.g., a DNA) that is free of 

the genes which, in the naturally-occurring genome of the organism from which the 
nucleic acid molecule of the invention is derived, flank the gene. The term therefore 
includes, for example, a recombinant DNA that is incorporated into a vector; into an 
. autonomously replicating plasmid or virus; or into the genomic DNA of a prokaryote or 

25 eukaryote; or that exists as a separate molecule (for example, a cDNA or a genomic or 
cDNA fragment produced by PCR or restriction endonuclease digestion) independent of 
other sequences. In addition, the term includes an RNA molecule which is transcribed 
from a DNA molecule, as well as a recombinant DNA which is part of a hybrid gene 
encoding additional polypeptide sequence. 
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By "polypeptide" is meant any chain of amino acids, regardless of length or post- 
translational modification (for example, glycosylation or phosphorylation). 

By an "isolated polypeptide" is meant a polypeptide of the invention that has 
been separated from components which naturally accompany it. Typically, the 
5 polypeptide is isolated when it is at least 60%, by weight, free from the proteins and 
naturally-occurring organic molecules with which it is naturally associated. Preferably, 
the preparation is at least 75%, more preferably at least 90%, and most preferably at 
least 99%, by weight, a polypeptide of the invention. An isolated polypeptide of the 
invention may be obtained, for example, by extraction from a natural source (for 

1 0 example, a pathogen); by expression of a recombinant nucleic acid encoding such a 
polypeptide; or by chemically synthesizing the protein. Purity can be measured by any 
appropriate method, for example, column chromatography, polyacrylamide gel 
electrophoresis, or by HPLC analysis. 

By "substantially identical" is meant a polypeptide or nucleic acid molecule 

15 (e.g., a polynucleotide) exhibiting at least 50% identity to a reference amino acid 

sequence (for example, any one of the amino acid sequences described herein) or nucleic 
acid sequence (for example, any one of the nucleic acid sequences described herein). 
Preferably, such a sequence is at least 60%, more preferably 80%, and most preferably 
90% or even 95% identical at the amino acid level or nucleic acid to the sequence used 

20 for comparison. 

Sequence identity is typically measured using sequence analysis software (for 
example, Sequence Analysis Software Package of the Genetics Computer Group, 
University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, WI 
53705, BLAST, BESTFIT, GAP, or PILEUP/PRETTYBOX programs). Such software 

25 matches identical or similar sequences by assigning degrees of homology to various 

substitutions, deletions, and/or other modifications. Conservative substitutions typically 
include substitutions within the following groups: glycine, alanine; valine, isoleucine, 
leucine; aspartic acid, glutamic acid, asparagine, glutamine; serine, threonine; lysine, 
arginine; and phenylalanine, tyrosine. In an exemplary approach to determining the 
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degree of identity, a BLAST program may be used, with a probability score between e" 3 
and e" 100 indicating a closely related sequence. 

By "transformed cell" is meant a cell into which (or into an ancestor of which) 
has been introduced, by means of recombinant DNA techniques, a polynucleotide 
5 molecule encoding (as used herein) a polypeptide of the invention. 

By "positioned for expression" is meant that the polynucleotide of the invention 
(e.g., a DNA molecule) is positioned adjacent to a DNA sequence which directs 
transcription and translation of the sequence (i.e., facilitates the production of, for 
example, a recombinant polypeptide of the invention, or an RNA molecule). 

10 By "purified antibody" is meant an antibody which is at least 60%, by weight, 

free from proteins and naturally-occurring organic molecules with which it is naturally 
associated. Preferably, the preparation is at least 75%, more preferably 90%, and most 
preferably at least 99%, by weight, antibody. A purified antibody of the invention may 
be obtained, for example, by affinity chromatography using a recombinantly-produced 

1 5 polypeptide of the invention and standard techniques. 

By "specifically binds" is meant a compound or antibody which recognizes and 
binds a polypeptide of the invention but which does not substantially recognize and bind 
other molecules in a sample, for example, a biological sample, which naturally includes 
a polypeptide of the invention. 

20 By "derived from" is meant isolated from or having the sequence of a naturally- 

occurring sequence (e.g., a cDNA, genomic DNA, synthetic, or combination thereof). 

By "inhibiting biofilm formation" is meant the ability of a candidate compound 
to decrease the development or progression of biofilm formation. Preferably, such 
inhibition decreases biofilm formation by at least 1% to 5%, more preferably by at least 

25 1 0%, 1 5%, 20%, or 25%, and most preferably by at least 30% to 50%, as compared to 
biofilm formation in the absence of the candidate compound in any appropriate 
pathogenicity assay (for example, those assays described herein). In one particular 
example, inhibition is measured by continuous culture conditions of a microbe exposed 
to a candidate compound or extract, a decrease in the level of biofilm formation relative 
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to the level of biofilm formation of the microbe not exposed to the compound indicating 
compound-mediated inhibition of biofilm formation. 

By "biofilm regulator polynucleotide" is meant a polynucleotide encoding a 
cellular component (e.g., PvrR) that modulates phenotypic switching, such as a 
5 phenotypic switch that occurs during biofilm formation, disintegration, or both. 

By "phenotypic switching" is meant the reversible alteration of one or more 
phenotypic characteristics. Such an alteration typically occurs, for example, when a 
wild-type microbe develops into an antibiotic-resistant phenotypic variant or when an 
antibiotic-resistant phenotypic variant develops into a wild-type microbe. 
10 By "immunological assay" is meant an assay that relies on an immunological 

reaction, for example, antibody binding to an antigen. Examples of immunological 
assays include ELISAs, Western blots, immunoprecipitations, and other assays known to 
the skilled artisan. 

By a "two-component regulatory system" is meant a regulatory system that 
15 includes at least two components such as a sensor that senses an environmental signal 
and a response regulator that modulates one or more effectors. 

By "aggregation" is meant a collection of two or more individual 
microorganisms into a mass or clump, such that the individuals form an aggregated 
microbial unit. Aggregation can be measured using assays provided herein. Examplary 
20 assays include visual inspection, measuring attachment to a surface, or by assaying for - 
biofilm formation using methods known to the skilled artisan. . 

By "pathogenicity" is meant the ability of a microorganism to cause disease. A 
microorganism that forms a biofilm, has increased antibiotic resistance, or displays 
phenotypic variation is more pathogenic than a wild-type microorganism in that it is less 
25 susceptibile to conventional antibiotic treatment. 

The invention provides a number of targets that are useful for the development of 
drugs that specifically block the biofilm formation of a microbe. In addition, the 
methods of the invention provide a facile means to identify compounds that are safe for 
use in eukaryotic host organisms (i.e., compounds which do not adversely affect the 
30 normal development and physiology of the organism), and efficacious against 
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pathogenic microbes (i.e., by suppressing the virulence of a pathogen). In addition, the 
methods of the invention provide a route for analyzing virtually any number of 
compounds for an anti-virulence effect with high- volume throughput, high sensitivity, 
and low complexity. The methods are also relatively inexpensive to perform and enable 
5 the analysis of small quantities of active substances found in either purified or crude 

extract form. 

i 

Other features and advantages of the invention will be apparent from the detailed 
description, and from the claims. 

10 Brief Description of the Drawings 

Figure 1 A shows the reversion of PAH rough small colony variants (RSCV) to 
the wild-type phenotype as observed at the edges of the colonies (arrow) after 2-3 days 
. incubation on antibiotic free LB agar at room temperature. 

Figure IB shows a confocal scanning laser microscopic analysis of bacterial 
15 aggregates (arrows) formed by wild-type PAH and PAH RSCV expressing green 
fluorescent protein (GFP) after overnight growth in liquid broth. Scale bar, 25 pm. 

Figure 1C shows the attachment of wild-type PAH and antibiotic resistant 
variants to polyvinylchloride plastic (PVC) after 6 hours of growth. 

Figure ID shows a confocal laser scanning microscope analysis of biofilm 
20 formed by wild-type PAH and PAH RSCV expressing GFP in flow-chambers under 
continuous culture conditions. Scale bar, 50 pm. 

Figure IE shows PAH and PAH RSCV biofilm resistance to tobramycin as 
determined by measuring viable biomass on 45 hour-old established biofilms before 
(filled bars) and after (open bars) 36-hour tobramycin (200 ng/ml) treatment. 
25 Figure 2A shows the effect of different environmental stimuli on the rate of 

appearance of antibiotic resistant variants. This was determined by growing the cultures 
of wild-type PAH under the specified conditions on media containing 200 p,g/ml 
kanamycin. 

Figure 2B shows the minimal inhibitory concentrations of kanamycin for strain 
30 PAH using the different conditions specified. 
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Figure 3A shows the reversion of PA14 RSCV present in sputum samples of a 
cystic fibrosis patient (designated "CF 5") as observed on the edges of the variant 
colonies (arrow) after prolonged incubation on antibiotic-free medium at room 
temperature. 

5 Figure 3B shows the increased attachment to PVC plastic of antibiotic resistant 

variants SCV 42 and SCV 43 obtained after plating CF isolates CF 42 and CF 43 on 
tobramycin (10 ng/ml). 

Figure 4A shows the attachment to PVC plastic of PA14, antibiotic resistant 
variants, and PA14 RSCV carrying pEd202 (PA14 RSCV /pED202) or pUCP19 (PA14 
10 RSCV /pUCP19) after 4 hours of growth was quantitated. 

Figure 4B shows the predicted amino acid sequence alignment of PvrR with the 
sequences that correspond to VieA from V. Cholerae and the P. aeruginosa PAOl 
putative response regulator PA3947 (PAOl RR). Numbers above the scale indicate 
number of amino acids. Lower panel contains domain family numbers according to 
15 ProDom nomenclature. 

Figure 4C shows that the pvrR gene is flanked by two open reading frame 
regions (ORFs), designated ORF1 and ORF3, with the same transcriptional orientation. 
Start codons within ORFs were assigned based on visual inspection for appropriately 
spaced ribosome-binding sequences . 
20 Figure 4D shows the number of variants resistant to kanamycin (200 |ag/ml). 

This was evaluated after plating overnight cultures of PA 14 and PAH overexpressing 
PvrR(PA14/pED202). 

Figure 4E shows the attachment to PVC plastic of PA14 and PA14 
overexpressing PvrR (PA14/pED202) after 12 hours of growth, quantitated as described 
25 herein. 

Figure 4F shows the number of antibiotic resistant variants for PA 14 and the 
pvrR mutant (ApvrR) as determined by plating overnight cultures on LB agar containing 
kanamycin (200 |ig/ml). 

Figure 5 A shows the nucleic acid sequence of pvrR (SEQ ID NO:l). 
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Figure 5B shows the nucleic acid sequence of an ORF1 polynucleotide (SEQ ID 
NO:3), This polynucleotide sequence begins at nucleotide 1504 and ends at nucleotide 
2919 of SEQ ID NO: 7 as shown in Figure 5G. 

Figure 5C shows the nucleic acid sequence of an ORF3 polynucleotide (SEQ ID 
5 . NO:5). This polynucleotide sequence begins at nucleotide 4385 and ends at nucleotide 
6379 of SEQ ID NO:7 as shown in Figure 5G. 

Figure 5D shows the deduced amino acid sequence of PvrR (SEQ ID NO:2). 

Figure 5E shows the deduced amino acid sequence of a polypeptide (SEQ ID 
NO:4) encoded by the polynucleotide shown in Figure 5B. 
10 Figure 5F shows the deduced amino acid sequence of a polypeptide (SEQ ID 

NO:6) encoded by the polynucleotide shown in Figure 5C. 

Figure 5G shows the nucleic acid sequence (SEQ ID NO:7) that includes the 
pvrR gene (SEQ ID NO:l), and the ORF1 (SEQ ID NOS:3 and 8-18) and ORF3 (SEQ 
ID NOS:5 and 30-34) regions. The start and stop codons for the identified open reading 
1 5 frames are highlighted. 

Figures 6A-6K show the nucleotide sequences of several open reading frames 
identified in the ORF1 region (SEQ ID NO:8 begins at nucleotide 124 and ends at 
nucleotide 2919; SEQ ID NO:9 begins at nucleotide 199 and ends at nucleotide 2919; 
SEQ ID NO:10 begins at nucleotide 217 and ends at nucleotide 2919; SEQ ID NO:l 1 
20 begins at nucleotide 256 and ends at nucleotide 2919; SEQ ID NO:12 begins at ■ 

nucleotide 295 and ends at nucleotide 2919; SEQ ID NO: 13 begins at nucleotide 307 
and ends at nucleotide 2919; SEQ ID NO:14 begins at nucleotide 51 1 and ends at 
nucleotide 2919; SEQ ID NO: 15 begins at nucleotide 760 and ends at nucleotide 2919; 
SEQ ID NO: 16 begins at nucleotide 790 and ends at nucleotide 2919; SEQ ID NO: 17 
25 begins at nucleotide 9 1 9 and ends at nucleotide 29 1 9; and SEQ ID NO 1 8 begins at 
nucleotide 1429 and ends at nucleotide 2919). 

Figures 6L-6 V show the deduced amino acid sequences of the polypeptides 
(SEQ ID NOS: 19-29) identified in Figures 6A-6K above. 

Figures 7A-7E show the nucleotide sequence of several open reading frames 
30 identified in the ORF3 region (SEQ ID NO:30 begins at nucleotide 4388 and ends at 
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nucleotide 6379; SEQ ID NO:3 1 begins at nucleotide 4550 and ends at nucleotide 6379; 
SEQ ID NO:32 begins at nucleotide 4572 and ends at nucleotide 6379; SEQ ID NO:33 
begins at nucleotide 4880 and ends at nucleotide 6379;and SEQ ID NO:34 begins at 
nucleotide 5258 and ends at nucleotide 6379). 
5 Figures 7F-7 J show the deduced amino acid sequences of the polypeptides (SEQ 

ID NOS:35-39) identified in Figures 7A-7E above. 

Detailed Description 

Overview 

10 Pseudomonas aeruginosa is the most important pathogen in the lungs of cystic 

fibrosis (CF) patients. Colonization of the CF lung by P. aeruginosa persists despite the 
use of long-term antibiotic therapy, since antibiotic treatment rarely results in 
eradication of the infection. Reports have suggested a direct link between resistance to 
f antimicrobial compounds and the ability of P. aeruginosa to form biofilm in CF lungs. 

1 5 Other hypotheses explain P. aemginosa antibiotic resistance by postulating that factors 
within the CF respiratory tract select for phenotypic variants suited to survive 
antimicrobial treatment. As is discussed below, we have determined that a clinical 
isolate of P. aeruginosa, strain PA 14, was capable of growing under inhibitory 
concentrations of the antibiotic kanamycin (up to 40 times the susceptibility level of the 

20 strain) when bacteria had undergone phenotypic variation. The antibiotic resistant 

variant colonies obtained from kanamycin plates were smaller in size and had a different 
colony morphology compared to the wild-type. Analysis of the phenotype of PA14 
RSCV indicated that these variants exhibited increased aggregation and attachment to 
glass tubes and polyvinylchloride plastic (P VC) as a result of enhanced surface 

25 hydrophobicity. Consistent with these observations, several PA14 RSCV clones were 
hyperpiliated when analysed by transmission electron microscopy. Moreover, 
examination of biofilms cultivated in flow chamber cells showed that PA14 RSCV 
formed more biofilm and faster than the wild-type strain. The biofilm formed by PAH 
RSCV also showed increased resistance to tobramycin relative to wild-type PA14 

30 biofilm. Similar results were obtained for several CF isolates using different antibiotics 
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(including tobramycin), suggesting that nonspecific antibiotic resistance acquired 
through phenptypic variation is a common mechanism in P aeruginosa. Moreover, 
analysis of sputum samples taken from CF patients revealed that antibiotic treatment 
selects for antibiotic resistant variants. The frequency with which antibiotic resistant 
5 variants appeared was also affected by environmental stimuli. Environmental stimuli 
such as salt concentration, temperature, and bacterial media altered the frequency of 
appearance of resistant variants. 

To identify components involved in the regulation of antibiotic resistance 
mediated by phenotypic variation, a library of PA 14 chromosomal DNA was transferred 

1 0 into PA14 RSCV and screened for colonies displaying wild-type colony size and 

morphology. This led to the identification of a clone, pED202, that restored the colony, 
the autoagglutination, and attachment phenotypes of PAH RSCV variants to wild-type. 
pED202 contained a single gene (designated pvrR for phenotype variant regulator) that 
showed sequence similarities to response regulator elements of the two-component 

1 5 regulatory system found in Vibrio cholerae response regulator VieA, and in P. 
aeinginosa strain PAOl (ORF PA3947). 

Consistent with the putative role of PvrR in the regulation of phenotypic 
switching, overexpression of PvrR from pED202 in wild-type PAH resulted in reduced 
attachment to PVC plastic. Moreover, examination of the frequency of resistant variants 

20 obtained from kanamycin plates showed a reduction in the number of colonies resistant 
to antibiotic obtained from the PvrR overexpressing strain. An in-frame deletion of 
pvrR (ApvrR) constructed in PAH increased frequency of appearance of resistant 
variants on kanamycin plates with respect to the wild-type, confirming the involvement 
of pvrR in the regulation of phenotypic switching. These results suggested that PvrR 

25 might be acting upstream of the switch, since inactivation of pvrR by mutation did not 
result in conversion to the variant type. 

Below we describe the cloning and characterization of PvrR, a regulator of 
biofilm-mediated antibiotic resistance and a target for compounds useful in antibacterial 
therapy, along with antibiotics, for the treatment of chronic infections and biofilm 

30 control in medical and industrial settings. In addition, we describe the identification of 
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open reading frame regions, designated ORF1 and ORF3, that flank the pvrR gene. The 
following examples are for the purposes of illustrating the invention, and should not be 
construed as limiting. 

5 Appearance of Rough Small Colony Variants with Increased Antibiotic Resistance 

When, cultured under high concentrations of antibiotic, Pseudomonas aeruginosa 
PAH was found to shift its development to a rough small colony phenotype, leading to 
the production of antibiotic resistant colonies. To induce such phenotypic variants, an 
overnight culture of P. aeiwginosa strain PA14 (UCBPP-PA14) was inoculated onto 

10 Luria-Bertani (LB) containing 200 \igjml of kanamycin, incubated at 37°C for 48 hours, 
at which time, antibiotic resistant rough small variants were isolated. Antibiotic 
resistant colonies arose at a frequency of 10" 6 -10" 7 . The colonies identified on these 
plates were one-tenth the size of wild type and exhibited a rough phenotype compared to 
the smooth colony type of wild-type PA 14, One class of kanamycin resistant variants 

1 5 (approximately 30%) exhibited a rough phenotype compared to the smooth colony type 
of wild-type PA14. When incubated for three to five days in LB media without 
antibiotic at room temperature, the rough phenotype reverted to the wild-type phenotype 
(Figure 1 A), indicating that the phenotypic changes were transient, and not due to 
mutation. In addition to being resistant to kanamycin, (up to 40 times the susceptibility 

20 level of the wild-type), 8 individual PAH RSCV colonies tested were also resistant to 
amikacin (30 \xg/ml) 9 carbenicillin (300 |ig/ml), gentamicin (30 fig/ml), tobramycin (10 
^ig/ml), and tetracycline (150 |xg/ml). Consistent with this latter result, antibiotic 
resistant variants were also obtained at frequencies of about 10" 7 by plating overnight 
cultures of PA14 on media containing similar concentrations of the antibiotics 

25 mentioned above. Although RSCV colonies were smaller than wild-type, their small 

colony size was not a consequence of slow growth since the generation time of RSCV in 
liquid medium was not significantly different from that of the wild-type, even in LB 
containing 200 fig/ml kanamycin. 
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Phenotvpic Changes Associated With Appearance of Resistance 

To establish a connection between the phenotypic switch from wild-type to small 
variant colony and the emergence of antibiotic resistance, comparative attachment, 
agglutination, and biofilm formation studies of wild-type PA14 and PAH RSCV were 
5 conducted. 

The results of these experiments showed that PA14 RSCV formed visible 
bacterial aggregates when overnight liquid cultures were left without shaking at room 
temperature (Figure IB). Moreover, abundant bacterial aggregates formed when liquid 
cultures were grown with gentle agitation, indicating that PA14 RSCV had increased 

10 cell-cell attachment compared to the wild-type phenotype. 

In addition to the autoagglutination phenotype, PA14 RSCV developed a visible 
biofilm on the walls of glass tubes after overnight incubation in liquid culture. Wild- 
type PA 14 failed to form a similar biofilm under these conditions. These results 
indicated that cell-surface interactions, as well as cell-cell interactions were increased in 

1 5 the variant. Consistent with this observation, PA14 RSCV were found to have increased 
attachment to PVC plastic (Figure 1C) in assays conducted in 96- well microtiter plates. 
When reversion was induced in PAH RSCV, the reverted bacteria showed wild-type 
levels of both agglutination and attachment to glass and PVC plastic. 

To quantitatively assess differences between the strains, standard bacterial 

20 attachment assays were performed in 96-well polyvinylchloride (PVC) plastic plates 
according to the methods described by O'Toole et al. (Mol Microbiol 30: 295, 1998). 
Overnight cultures of PAH and PAH RSCVwere diluted to an OD 6 oo of 0.1 in fresh 
minimal M63 salts supplemented with glucose (0.3%), MgS0 4 (1 mM), and casamino 
acids (CAA, 0.5%). Aliquots of 100 \il were next dispensed into the wells of PVC 

25 plastic microtiter plates and incubated for 6 hours at 37°C. The attachment of bacteria to 
the walls of the microtiter well was then detected by staining with 1% crystal violet 
dissolved in water. Dye not associated with bacteria was removed by thorough rinsing 
with water. Bacteria-associated dye was solubilized using 95% ethanol and absorbance 
was determined at OD 55 o. 
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In addition, since the ability of bacteria to attach to each other and to surfaces 
depends in part on the interaction of hydrophobic domains (Drumm et al., J. Clin. Invest 
84:1588, 1989), the hydrophobic surface properties of the wild-type and PA14 RSCV 
were determined using a standard salt aggregation test (Sherman et al., Infect. Immun. 
5 49:797, 1985). 5 x 10 8 bacteria per ml in 0.025 ml were mixed on a microscope slide 
with an equal volume of ammonium sulfate in 0.002 M sodium phosphate, pH 6.8. The 
ammonium sulfate concentrations varied from 0.0625 M to 4.0 M, and the presence of 
, salt-induced bacterial aggregation was monitored for 2 minutes at room temperature by 
phase-contrast microscopy. Agglutination in salt concentrations of less than 0.1 M is 

10 taken as an indication of the presence of a hydrophobic bacterial surface. Hydrophilic 
surfaces were demonstrated by the agglutination of bacteria only in high salt 
concentrations (2.0 to 4.0 M). 

The data obtained from the salt aggregation tests showed that PA14 RSCV were 
agglutinated at a lower salt concentration (0.125 M) compared to the wild-type PA14 

15 (0.5 M), suggesting that PA14 RSCV has a higher degree of surface hydrophobicity than 
the wild-type. Therefore, the data indicated that a change in the hydrophobic properties 
of the surface of the bacteria was partially responsible for the general increase in surface 
attachment of the PAH RSCV phenotypic variant. To further demonstrate the role of 
hydrophobicity in surface attachment, PAH RSCV were cultured in the presence of 

20 tetramethyl urea (TMU), a hydrophobic bond-breaking agent, at a concentration of 200 
mM. Addition of TMU to the culture media was found to reduce the attachment of the 
phenotypic variant PA 14 RSCV to wild-type levels, confirming the hydrophobic nature 
of the bacterial surface. TMU, at the concentration used in these assays, did not affect 
cell viability. 

25 Transmission electron microscopic analysis of several PAH RSCV clones 

revealed that they were hyperpiliated, which is consistent with the increased 
hydrophobicity and agglutination phenotypes. However, the various phenotypes of 
PAH RSCV were not simply a consequence of hyperpiliation since a hyperpiliated 
mutant of P. aeruginosa PAH, pilU, exhibited only marginally enhanced 

30 hydrophobicity and attachment to PVC plastic and did not exhibit enhanced resistance to 
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antibiotics (data not shown). These results are consistent with previous reports which 
indicated that phenotypic variation in Gram-negative bacteria involve changes in 
expression of a number of surface structures, outer membrane proteins, and 
lipopolysaccharides resulting in altered aggregation and colony morphology. Several 
5 PA14 RSCV clones were tested in the experiments described above and all exhibited 
similar phenotypes. A single PAH RSCV clone was therefore chosen for fiirther 
analysis. 

To determine whether the antibiotic resistant phenotype of PAH RSCV is 
associated with altered biofilm formation, PAH RSCVwas cultured under biofilm- 

10 forming conditions as follows. For biofilm characterization, PAH RSCV biofilms were 
cultivated under continuous culture conditions in flow-chambers with channel 
dimensions of 1 2 by 52 by 2 mm. Flow media consisted of M63 supplemented with 
0.5% casamino acids and 0.3% glucose. For measurement of biofilm resistance, bacteria 
were cultivated in flow-chambers with channel dimensions of 1 by 40 by 4 mm (Stovall . 

1 5 Inc., Greensboro, NC). In this case, flow media consisted of FAB medium (0.1 mM 
CaCl 2 , 0.01 mM Fe-EDTA, 0.15 mM NH 4 S0 4 , 0.33 mM Na 2 HP0 4 , 0.2 mM KH 2 P0 4 
and 1 mM MgCl 2 ) supplemented with casamino acids (0.5%) and sodium citrate (10 
mM), Flow-cells in both cases were inoculated with 100-fold dilutions of overnight 
cultures of PAH and PAH RSCV carrying the green fluorescent protein (GFP) in 

20 plasmid SMC21, a derivative of pSMC2 (Bloemberg et al., Appl Environ. Microbiol 
63: 4543-4551, 1997). After inoculation, the medium flow was stopped for 1 hour. 
Medium flow was then resumed at a rate of 0.2 ml/min using a peristaltic pump 
(IsmaTec, Zurich, Switzerland), and the flow-cell system was incubated at 37° C. 
Analysis of biofilm spatial structures was performed using confocal scanning laser 

25 microscopy (CSLM) using a Leica TCS SP system (Leica Lasertechnik, GmgH, 
Heidelberg, Germany). Image analysis of antibiotic-treated biofilms was done in 
structures contained within serial section stacks of images delimited by freehand 
drawing. Pixel intensities unique to GFP-labeled bacteria and surrounding biofilm were 
established by the threshold limit technique. The volume (in pm 3 ) of individual biofilm 
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structures was determined from serial sections using ImageSpace software (Molecular 
Dynamics, Sunnyvale, CA). 

The results from these studies showed that the PA14 RSCV phenotypic variant 
formed not only more biofilm than the wild-type strain, but also formed biofilm faster 
5 (RSCV microcolonies appeared 4-5 hours earlier than wild-type). Moreover, PAH 
RSCV and wild-type PAH displayed significantly different patterns of biofilm 
development. Wild-type PAH initially formed regularly-spaced, flat, circular, 
microcolonies that eventually developed into ball-shaped microcolonies. In contrast, 
PAH RSCV formed irregularly shaped three-dimensional structures that were densely 
10 packed with bacteria, without the typical microcolony morphology (Figure ID). Finally, 
the biofilm structures formed by PAH RSCV were larger in size than the wild-type 
microcolonies, and biofilms from PAH RSCV contained more biomass than the wild- 
type. 

To determine whether PAH and PAH RSCV biofilms exhibited antibiotic 

1 5 resistance that paralleled the resistance observed on plates containing antibiotic, 

established PAH and PAH RSCV biofilms grown in flow chambers were exposed to a 
continuous flow of tobramycin (200 ng/ml). Viable biomass was measured by CSLM 
analysis of GFP-tagged PAH and PAH RSCV cells using GFP expression as a viability 
marker as described previously (Figure IE). Consistent with the results obtained in 

20 plates, the biofilm formed by PAH RSCV was more resistant to tobramycin treatment 
than the wild-type PAH biofilm. 

Phenotypic variation is a common phenomenon in Gram-negative bacteria that 
often involves environmentally regulated changes in observable phenotypes produced by 
modifications in surface components. The effect that different environmental stimuli 

25 had on the appearance of kanamycin-resistant phenotypic variants was examined. 

Bacteria were grown in LB broth, or in supplemented LB with appropriate antibiotics at 
the indicated temperature with aeration. As shown in Figure 2 A, a 40-fold increase in 
the frequency of appearance of resistant variants (not just PAH RSCV) was observed on 
LB media supplemented with 85 mM NaCl as compared to the same medium without 

30 NaCl. Moreover, the frequency of variants increased 200-fold when plates were 
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incubated at 25°C compared to 37°C (Figure 2A). Finally, a dramatic 10 6 - fold increase 
was obtained on minimal M63 salts as compared to LB medium (Figure 2A). Minimal 
salt media consisted of M63 supplemented with 0.3% glucose, 1 mM MgS0 4 , and 0.5% 
casamino acids. Importantly, there was a correlation between the frequency of 
5 appearance of kanamycin resistant variants on plates and minimal inhibitory 

concentrations (MICs) of kanamycin in liquid culture for the wild-type PA14 using the 
culture conditions described above (Figure 2B). For example, the high frequency of 
resistant variants obtained on M63 correlated with the relatively high concentration of 
kanamycin (475 Mg/ml) required to inhibit the growth of PA14 in M63 liquid medium 
1 0 (Figures 2A and 2B). These data indicated that the components involved in the 

formation of antibiotic resistant variants are differentially regulated by environmental 
signals. Moreover, the data indicated that the portion of the population that becomes 
resistant to antibiotics through phenotypic variation was largely dependent on 
environmental conditions. 

15 

Small Colony Variants in CF Sputum Samples 

The presence of phenotypic variants with small colony phenotypes has been 
reported in cystic fibrosis (CF) patients (Haussler et al., Clin. Infect. Dis. 29:621, 1999). 
Emergence of this and other variant phenotypes in the CF lung has also been linked to 

20 prolonged antibiotic treatment (McNamara et al., Int. J. Antimicrob. Agents 14:117, 
2000; Kahl et al., J. Infect. Dis. 177:1023, 1998). To investigate whether antibiotic 
treatment in P. aeruginosa CF infections results in selection for resistant variants, we 
looked for the presence of small colony variants in CF sputum samples. 

Five CF sputum samples from the Clinical Microbiology Laboratory at 

25 Massachusetts General Hospital were suspended in 5 ml of 10 mM MgSC>4. Serial 
dilutions of the samples were then plated onto cetrimide agar plates with and without 
antibiotics. The plates were screened for the presence of P. aeruginosa after 24 and 48 
hours of incubation at 37°C. The identity of the colonies was later confirmed by probing 
colony lifts with the exotoxin A gene from P. aei-uginosa. To this end, the EcoRI- 

30 Hindlll fragment of plasmid pRGI containing the exoA gene (Samadpour et al., J. Clin. 
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Microbiol 26:2319-23, 1988) was gel isolated and labeled using a random priming kit 
, (Boehringer, Mannheim, Indianapolis, Ind.). Colonies were transferred to nylon 
membranes and hybridizations were performed according to the manufacturer's 
recommendations (NEN Research Products, Boston, MA). Identification of colonies 
5 carrying the exoA gene was then performed using a Phosphorimager (Amersham 
Pharmacia Biotech Inc., Piscataway, NJ ). 

Five sputum samples obtained from five CF patients were evaluated for the 
presence of small colony variant bacteria. . Two out of five sputum samples obtained 
from CF patients (patients 5 and 38) contained 100% rough small colony variants (Table 
10 1) that reverted to a wild-type colony morphology upon prolonged incubation on 

. antibiotic-free medium (Figure 3 A). Importantly, both samples 5 and 38 corresponded 
to patients that were undergoing antibiotic treatment at the time the samples were 
obtained (intravenous (IV) amikacin/ceftazidime for two days and oral (O) 
levofloxacin/inhaled (I) tobramycin for six weeks respectively Table 1). 



15 



TABLE 1 



Sample 5 Sample 38 Sample 41 Sample 42 Sample 43 

Antibiotic treatment of CF Amikacin(LV) Tobramycin (I) none none none 

patients Ceftazidime(TV) Levofloxacin(O) 

Small Colony variants in 100 100 <0.11 0.00 <0.12 
sputum sample (%) 

Variants resistant to amikacin 100 100 15 5 0.2 

(%) 

Variants resistant to gentamicin 100 100 10 6.6 0.5 



Variants resistant to 30 32 0 0 Not done 

tetracycline (%) 

Variants resistant to 50 100 0.10 0 0.5 

tobramycin (%) 
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Table 1 shows the presence of small colony P. aeruginosa variants in sputum 
samples from five CF patients. The presence of P. aeruginosa antibiotic resistant small 
colony variants was determined by plating CF sputum samples on cetrimide agar with 
and without the indicated antibiotics. 
5 Moreover, there was 29% enrichment in small colony variants in samples taken 

on two consecutive days from the patient that was undergoing intravenous antibiotic 
treatment. 

As shown in Table 1, 30-100% of the small colony variants present in samples 5 
and 38 were resistant to four different antibiotics (amikacin, gentamicin, tetracycline, 

10 and tobramycin) at concentrations equal to or higher than the minimal bactericidal 
concentration (MBC) of their respective reverted colonies. The proportion of small 
colony variants present in the samples that showed resistance to amikacin, gentamicin, 
tetracycline, and tobramycin was analyzed by simultaneously plating the sputum 
samples in cetrimide agar with and without antibiotics. The data obtained were 

15 compared to MBCs of the reverted colonies for the antibiotics in which variants were 
obtained In vitro susceptibility (MBC) to the different antibiotics used during the assays 
was determined by a standard tube dilution procedure described by Bailey and Scott 
(Diagnostic Microbiology, 3 1 3-329, 1 974). 

Although the other three CF sputum samples (41, 42 and 43) appeared to contain 

20 either a small proportion or no detectable small colony variants when plated on 

antibiotic free media, they did contain a considerable number (0.5-15%) of antibiotic 
resistant variants (Table 1). This discrepancy was due to the fact that it took the small 
colony variants 36-40 hours to form visible colonies, at which time the fast growing 
wild-type bacteria present in the sputum samples had overgrown the antibiotic free 

25 plates. Resistant variants with small colony phenotypes obtained from plating CF 

isolates 42 and 43 on media containing tobramycin (a front-line antibiotic used for the 
treatment of P. aeruginosa infections) exhibited increased attachment to PVC plastic 
(Figure 3B). 
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Identification of the Phenotypic Variation Regulator Gene 

Phenotypic variation is a common mechanism in Gram-negative bacteria, and 
involyes changes in observable phenotypes produced by modifications in surface 
components such as fimbriae, flagella, outer membrane proteins, and 
5 lipopolysaccharides. In the mushroom pathogen P. tolaasii, Greewal et al. (J. BacterioL 
177:4658, 1995) identified a two-component regulatory element responsible for the 
phenotypic switch from smooth to rough phenotype that involved changes in colony 
morphology and motility. Since the phenotype displayed by PAH RSCV was transient 
and involved alterations in surface properties, we hypothesized that a regulatory 

10 component was also responsible for the phenotypic switch observed in PAH. 

To identify this component, a genomic library of strain PA 14 constructed in the 
cosmid vector pJSRl (Rahme et al., Science 268:1899, 1995) was mobilized in masse 
into PAH RSCV by triparental mating using helper strain pRK2013 (Figurski et al., 
Proc. Natl. Acad. Sci. USA 76:1648, 1979). The resulting transconjugants were 

15 screened visually for colonies showing wild- type size and morphology (smooth colony 
phenotype). Individual transconjugants that showed wild-type characteristics were used 
to isolate the corresponding cosmids which were then reintroduced into PAH RSCV to 
confirm the reversion of the phenotype. Moreover, cosmid DNA from the 
transconjugants was digested to completion with the restriction enzymes EcoRI, PstI, 

20 and Hindlll and separated by electrophoresis on a 0.7% agarose gel. 

A total of 2,500 transconjugants were screened for colonies displaying wild-type 
PAH colony size and morphology. Two transconjugants that showed wild-type 
phenotypes were isolated, indicating that the inserts contained in the cosmids were able 
to induce reversion from small colony variant to wild-type phenotype. Two cosmid 

25 clones were isolated and reintroduced in PAH RSCV to test for restoration of wild- type 
phenotype, and both clones were found to be capable of greatly enhancing the rate of 
PAH RSCV reversion to the wild-type phenotype. Restriction digest profiles obtained 
with EcoRI, PstI, and Hindlll restriction enzymes showed the presence of a cosmid with 
the same insert in both cases, which was designated pED20. Although the PAH RSCV 

30 phenotype was normally very stable in liquid culture (i.e., no wild-type revertants 
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observed when an overnight culture was plated on LB agar), the majority of the cells in a 
PA14 RSCV culture carrying pED20 formed wild-type colonies after overnight 
incubation. 

Cosmid pED20 was then subcloned into the pUCP19 plasmid vector using a PstI 
5 restriction digest. The clones obtained after transformation in E. coli were used to 
isolate plasmid DNA that was subsequently introduced into PA14 RSCV by 
electroporation. The resulting clones were screened visually for colonies showing wild- 
type size and morphology. Subcloning of pED20 produced pED202, which contained a 
3.5-kb fragment, that restored the colony phenotype of PA14 RSCV variant to wild-type. 

10 Clone pED202 restored attachment phenotypes (Figure 4A), as well as the colony 
morphology and autoagglutination phenotypes of PA 14 RSCV variants to wild-type. 
The vector alone did not have any effect on the phenotypes analyzed. 

DNA sequencing and sequence analysis of the pED202 insert was then 
performed. The DNA fragments used for sequencing were PCR amplified initially using 

15 primers M13 and M13 reverse from the pUCP19 plasmid. Primers were later 

synthesized based on the sequencing data obtained. Sequencing data were analyzed 
using the DNAStar software (DNASTAR Inc., Madison, WI) to predict the open reading 
frames present in the pED202 3.5 kb insert. Sequence information was also compared 
with the sequence databases at the National Center for Biotechnology Information as 

20 well as to the P. aeruginosa PAOl sequence generated by the P. aeruginosa genome 
project (Cystic Fibrosis Foundation and PathoGenesis Corporation). 

Analysis of the sequencing data obtained from clone pED202 showed that the 
clone contained only one intact open reading frame. The nucleotide and predicted 
amino acid sequences of the ORF (designated pvrR for phenotype variant regulator) 

25 contained in clone pED202 were compared to the GenBank databases, and showed 
sequence similarities to response regulator elements of the two-component regulatory 
system. The search revealed 30% identity and 45% similarity in a 376 amino acid 
overlap to the Vibrio cholerae response regulator VieA, which is induced during 
intestinal infection in mouse. In addition, the ORF on pED202 showed 29% identity and 

30 45% similarity to a probable two-component response regulator identified in P. 
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aeruginosa strain PAOl (ORF PA3947). Interestingly, the region of the PAH genome 
containing pvrR is not present in the fully sequence P. aeruginosa strain PAOl . 

A homology search against domain sequences in the ProDom database (ProDom 
web site; http://prodes.Toulouse.inra.fr/prodom) identified 4 regions with high-scoring 
5 segment pairs in PvrR (Figure 4B). All 4 domains are also present in Vie A and the 
PAOl putative response regulator (Figure 4B). Moreover, these 4 domains exhibit high 
levels of amino acid sequence similarity (30%-60%; Figure 4B). Sequence analysis of 
the regions located upstream and downstream of pvrR revealed the presence of two 
additional ORFs (designated 0RF1 and 0RF3 respectively; Figure 4C) with sequence 

1 0 homology to two-component regulatory elements . 

The protein encoded by 0RF1 has homology to probable sensor/response 
regulator hybrids from P. aeruginosa (35% identity and 49% similarity to ORF. 
PA2824), to the sensor protein RcsC (capsular synthesis regulator component C) from 
Salmonella enterica subsp. enterica serovar Typhi (30% identity and 51% similarity) 

15 and to a two-component sensor regulator (PheN) that modulates phenotypic switching in 
P. tolaasii t (31% identity and 45% similarity). The protein encoded by 0RF3 shows 
42% identity and 60% similarity to the GacS sensor kinase from P.fluorescens, and 
41% identity and 59% similarity to the two-component sensor regulator that modulates 
phenotypic switching in P. tolaasii (PheN). 

20 Figure 5G shows a nucleic acid sequence (SEQ ID NO:7) including 

polynucleotides identified in the ORF1 region (SEQ ID NOS:3, and 8-18),/?vri? (SEQ 
ID NO: 1), polynucleotides identified in the ORF3 region (SEQ ID NOS:5, and 30-34), 
and the intergenic regions. The start and stop codons for each open reading frame are 
indicated by highlighting. Figures 5B and 6A-K show the nucleotide sequences of 

25 several open reading frames identified in the ORF1 region. The deduced amino acid 

sequence of these open reading frames are shown in Figures 5E (SEQ ID NO:4) and 6L- 
6V (SEQ ID NOS: 19-29). 

Additionally, Figure 5C shows the nucleic acid sequence (SEQ ID NO:5) of one 
of several open reading frames identified in the ORF3 region. The deduced amino acid 

30 sequence of the polypeptide encoded by this nucleotide sequence is shown in Figure 5F 
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(SEQ ID NO:6). Figures 7A-7E (SEQ ID NOS:30-34) show the nucleotide sequences of 
several additional open reading frames identified in the ORF3 region. The deduced 
amino acid sequence of the polypeptides encoded by these nucleotide sequences are 
shown in Figures 7F-7J. 
5 To determine whether pvrR or a highly similar pvrR homolog was present in the 

other P. aeruginosa strains, PCR analysis of 14 P. aeruginosa strains was performed 
using jwJJ-specific primers. The specificity of the PCR products obtained was 
subsequently confirmed by Southern blotting and hybridization with a j?vriJ-specific 
probe. Results showed that 7 out of 7 CF isolates, 2 out of 3 clinical isolates and 3 out 
10 of 4 standard P. aei-uginosa laboratory strains contained the pvrR gene fragment or a 
highly similar fragment (data not shown). 



PvrR Overexpression 

Consistent with the putative role of PvrR in the regulation of phenotypic 

1 5 switching, overexpression of PvrR from pED202 resulted in a 6-fold reduction in the 
frequency of resistant variants obtained after plating overnight cultures on kanamycin 
(200 Hg/ml) plates compared to wild-type (Figure 4D). Plasmid pED202, containing the 
pvrR gene was introduced into wild-type PA14 by electoroporation using standard 
methods. Frequency of appearance of kanamycin resistant variants and attachment to 

20 96-well PVC plates was assayed as described above. Interestingly, the PvrR 

overexpressing strain also caused a 2.5-fold reduction in attachment to PVC plastic with 
respect to the strain carrying the vector alone (Figure 4E). 

. pvrR Deletion Analysis 

25 Since PvrR is involved in the regulation of the phenotypic switch from wild- type 

to phenotypic variant, a mutation in pvrR would be expected to alter the proportion of 
resistant variants present in the PA14 population. To test this hypothesis, a 914 bp in- 
frame deletion within pvrR (denoted "A pvrR") was generated by replacing 2.33 kb of 
the wild-type sequence of the pvrR gene with a 1 .416 kb fragment amplified by PCR. 

30 The PCR-amplified DNA fragment was subcloned into the Xbal and Smal restriction 
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sites of the positive selection suicide vector pCVD442 to generate pED167. Plasmid 
pED167 was then used in an allelic exchange procedure to introduce the fragment 
containing the deleted copy of pvrR into the homologous region of the PAH 
chromosome, creating strain ED78. The deletion was confirmed by sequencing a PGR 
5 fragment containing pvrR. 

This deletion of pvrR (kpvrR) in PAH resulted in an increased frequency of 
appearance of resistant variants on kanamycin plates with respect to the wild-type 
(Figure 4F), confirming the involvement of pvrR in the regulation of phenotypic 
switching. The observation that 100% of the variants expressing wild-type pvrR 
10 reverted to the wild-type phenotype implicates PvrR is inducing reversion from variant 
to wild-type phenotypes. These results indicated that PvrR may be acting upstream of 
the switch, since inactivation of pvrR by mutation was not found to result in conversion 
to the variant type. 

15 Isolation of Additional Biofilm Regulator Genes 

Based on the nucleotide and amino acid sequences described herein, the isolation 
and identification of additional coding sequences of genes regulating the formation of 
microbial biofilm is made possible using standard strategies and techniques that are well 
known in the art. For example, any microbe that possesses the ability to form a biofilm 

20 can serve as the nucleic acid source for the molecular cloning of such a gene, and these 
sequences are identified as ones encoding a protein exhibiting structures, properties, or 
activities associated with biofilm formation, such as the PvrR (Figure 5D, SEQ ID 
NO:2), or any of the polynucleotides identified in the ORF1 (SEQ ID NOS:3 and 8-18) 
and ORF3 (SEQ ID NOS:5 and 30-34) regions. 

25 In one particular example of such an isolation technique, any one of the 

nucleotide sequences described herein, including pvrR (Figure 5A, SEQ ID NO:l), 
ORF1 (Figure 5B, SEQ ID NO:3), or ORF3 (Figure SC, SEQ ID NO:5) may be used, 
together with conventional methods of nucleic acid hybridization screening. Such 
hybridization techniques and screening procedures are well known to those skilled in the 

30 art and are described, for example, in Benton and Davis (Science 196:180, 1977); 
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Grunstein and Hogness (Proc. Natl Acad, ScL, USA 72:3961, 1975); Ausubel et al. 
(Current Protocols in Molecular Biology, Wiley Interscience, New York, 2001); Berger 
and Kimmel (Guide to Molecular Cloning Techniques, 1987, Academic Press, New 
York); and Sambrook et aL, Molecular Cloning: A Laboratory Manual, Cold Spring 
5 Harbor Laboratory Press, New York. In one particular example, all or part of the pvrR, 
ORF1, or ORF3 sequences (described herein) may be used as a probe to screen a 
recombinant DNA library for genes having sequence identity to the pvrR 9 ORF1, or 
ORFS genes. Hybridizing sequences are detected by plaque or colony hybridization 
according to standard methods. 

10 Alternatively, using all or a portion of the amino acid sequences of PvrR, ORF1 , 

or ORF3, one may readily design pvrR, ORF1, or ORF3 gene-specific oligonucleotide 
probes, including degenerate oligonucleotide probes (i.e., a mixture of all possible 
coding sequences for a given amino acid sequence). These oligonucleotides may be 
based upon the sequence of either DNA strand and any appropriate portion of the pvrR, 

15 ORF1, or ORF3 sequences. General methods for designing and preparing such probes 
are provided, for example, in Ausubel et al. (supra), and Berger and Kimmel, Guide to 
Molecidar Cloning Techniques, 1987, Academic Press, New York. These 
oligonucleotides are useful for pvrR, ORF1, or ORFi gene isolation, either through their 
use as probes capable of hybridizing to pvrR, ORFI, or ORFi complementary sequences 

20 or as primers for various amplification techniques, for example, polymerase chain 
reaction (PCR) cloning strategies. If desired, a combination of different, detectably- 
labelled oligonucleotide probes may be used for the . screening of a recombinant DNA 
library. Such libraries are prepared according to methods well known in the art, for 
example, as described in Ausubel et al. (supra), or they may be obtained from 

25 commercial sources. 

As discussed above, sequence-specific oligonucleotides may also be used as 
primers in amplification cloning strategies, for example, using PCR. PCR methods are 
well known in the art and are described, for example, in PCR Technology, Erlich, ed., 
Stockton Press, London, 1989; PCR Protocols: A Guide to Methods and Applications, 

30 Innis et al., eds., Academic Press, Inc., New York, 1990; and Ausubel et al. (supra). 



.44- 



WO 03/004689 PCT/US02/21431 

Primers are optionally designed to allow cloning of the amplified product into a suitable 
vector, for example, by including appropriate restriction sites at the 5' and 3* ends of the 
amplified fragment (as described herein). If desired, nucleotide sequences may be . 
isolated using the PCR "RACE" technique, or Rapid Amplification of cDNA Ends (see, 
5 e.g., Innis et al. (supra)). By this method, oligonucleotide primers based on a desired 
sequence are oriented in the 3' and 5' directions and are used to generate overlapping 
PCR fragments. These overlapping 3 - and 5 ! -end RACE products are combined to 
produce an intact full-length cDNA. This method is described in Innis et al. (supra); and 
Frohman et al., Proc. Natl Acad Sci. USA 85:8998, 1988. 

10 Partial sequences, e.g., sequence tags, are also useful as hybridization probes for 

identifying ftdl-length sequences, as well as for screening databases for identifying 
previously unidentified related virulence genes. 

In general, the invention includes any nucleic acid sequence which may be 
isolated as described herein or which is readily isolated by homology screening or PCR 

1 5 amplification using any of the nucleic acid sequences disclosed herein such as those 
shown in Figures 5A, 5C, 5G, 6A-K, or 7A-7E. 

It will be appreciated by those skilled in the art that as a result of the degeneracy 
of the genetic code, a multitude of polynucleotide sequences encoding PvrR, ORF1, or 
ORF3, some bearing minimal similarity to the polynucleotide sequences of any known 

20 and naturally occurring gene, may be produced. Thus, the invention contemplates each 
and every possible variation of polynucleotide sequence that could be made by selecting 
combinations based on possible codon choices. These combinations are made in 
accordance with the standard triplet genetic code as applied to the polynucleotide 
sequence of naturally-occurring /?vri?, ORF1, or ORF3 9 and all such variations are to be 

25 considered as being specifically disclosed. 

Although nucleotide sequences which encode PvrR, ORF1, ORF3, or their 
variants are preferably capable of hybridizing to the nucleotide sequence of the 
naturally-occurring pvrR, ORF1, or ORF3 under appropriately selected conditions of 
stringency, it may be advantageous to produce nucleotide sequences encoding PvrR, 

30 ORF1, ORF3, or their derivatives possessing a substantially different codon usage, e.g., 
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inclusion of non-naturally occurring codons. Codons may be selected to increase the 
rate at which expression of the peptide occurs in a particular prokaryotic or eukaryotic 
host in accordance with the frequency with which particular codons are utilized by the 
host. Other reasons for substantially altering the nucleotide sequence encoding PvrR, 
5 ORF1 , ORF3, and their derivatives without altering the encoded amino acid sequences 
include the production of RNA transcripts having more desirable properties, such as a 
greater half-life, than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode 
PvrR, ORF1, ORF3, or fragments thereof generated entirely by synthetic chemistry.' 

10 After production, the synthetic sequence may be inserted into any of the many available 
expression vectors and cell systems using reagents well known in the art. Moreover, 
synthetic chemistry may be used to introduce mutations into a sequence encoding any 
one of PvrR, ORF1 , ORF3, or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are 

15 capable of hybridizing to the claimed polynucleotide sequences, and, in particular, to 
those shown in Figure 5A, 5B, 5C, 5G, 6A-6K, or.7A-7E and fragments thereof under 
various conditions of stringency. (See, e.g., Wahl, G. M. and S. L. Berger (1987) 
Methods Enzymol. 152:399; Kimmel, A. R. (1987) Methods EnzymoL 152:507) For 
example, stringent salt concentration will ordinarily be less than about 750 mM NaCl 

20 and 75 mM trisodium citrate, preferably less than about 500 mM NaCl and 50 mM 
trisodium citrate, and most preferably less than about 250 mM NaCl and 25 mM 
trisodium citrate. Low stringency hybridization can be obtained in the absence of 
organic solvent, e.g., formamide, while high stringency hybridization can be obtained in 
the presence of at least about 35% formamide, and most preferably at least about 50% 

25 formamide. Stringent temperature conditions will ordinarily include temperatures of at 
least about 30 °C, more preferably of at least about 37 °C, and most preferably of at least 
about 42 °C. Varying additional parameters, such as hybridization time, the 
concentration of detergent, e.g., sodium dodecyl sulfate (SDS), and the inclusion or 
exclusion of carrier DNA, are well known to those skilled in the art. Various levels of 

30 stringency are accomplished by combining these various conditions as needed. In a 
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preferred embodiment, hybridization will occur at 30 °C in 750 mM NaCl, 75 mM 
trisodium citrate, and 1% SDS. In a more preferred embodiment, hybridization will 
occur at 37 °C in 500 mM NaCl, 50 mM trisodium citrate, 1% SDS, 35% formamide, 
and 100 |ig/ml denatured salmon sperm DNA (ssDNA). In a most preferred 
5 embodiment, hybridization will occur at 42 °C in 250 mM NaCl, 25 mM trisodium 
citrate, 1% SDS, 50% formamide, and 200 jig/ml ssDNA. Useful variations on these 
conditions will be readily apparent to those skilled in the art. 

The washing steps which follow hybridization can also vary in stringency. Wash 
stringency conditions can be defined by salt concentration and by temperature. As 

10 above, wash stringency can be increased by decreasing salt concentration or by 

increasing temperature. For example, stringent salt concentration for the wash steps will 
preferably be less than about 30 mM NaCl and 3 mM trisodium citrate, and most 
preferably less than about 15 mM NaCl and 1.5 mM trisodium citrate. Stringent 
temperature conditions for the wash steps will ordinarily include temperature of at least 

15 about 25 °C, more preferably of at least about 42 °C, and most preferably of at least 

about 68 °C In a preferred embodiment, wash steps will occur at 25°C in 30 mM NaCl, 
3 mM trisodium citrate, and 0.1% SDS. In a more preferred embodiment, wash steps 
will occur at 42°C in 15 mM NaCl, 1 .5 mM trisodium citrate, and 0. 1% SDS. In a most 
preferred embodiment, wash steps will occur at 68°C in 15 mM NaCl, 1.5 mM trisodium 

20 citrate, and 0.1% SDS. Additional variations on these conditions will be readily 
apparent to those skilled in the art. 

Methods for DNA sequencing are well known in the art and may be used to 
practice any of the embodiments of the invention. The resulting sequences are analyzed 
using a variety of algorithms which are well known in the art. (See, e.g., Ausubel, F. M. 

25 (1 997) Short Protocols in Molecular Biology, John Wiley & Sons, New York N. Y., unit 
7.7) 

Polypeptide Expression 

In general, polypeptides of the invention (e.g., PvrR, ORF1 , or ORF3 as shown 
30 in Figures 5D, 5E, 5F, 6L-6V, or 7F-7 J) may be produced by transformation of a 
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suitable host cell with all or part of a polypeptide-encoding nucleic acid molecule or 
fragment thereof in a suitable expression vehicle. 

Those skilled in the field of molecular biology will understand that any of a wide 
variety of expression systems may be used to provide the recombinant protein. The 
5 precise host cell used is not critical to the invention. A polypeptide of the invention may 
be produced in a prokaryotic host (e.g., E. colt) or in a eukaryotic host (e.g., 
Saccharomyces cerevisiae, insect cells, e.g., Sf21 cells, or mammalian cells, e.g., NIH 
3T3, HeLa, or preferably COS cells). Such cells are available from a wide range of 
sources (e.g., the American Type Culture Collection, Rockland, MD; also, see, e.g., 

10 Ausubel et al., supra). The method of transformation or transfection and the choice of 
expression vehicle will depend on the host system selected. Transformation and 
transfection methods are described, e.g., in Ausubel et al. (supra); expression vehicles 
may be chosen from those provided, e.g., in Cloning Vectors: A Laboratory Manual 
(P.H. Pouwels et al., 1 985, Supp. 1 987). 

1 5 One particular bacterial expression system for polypeptide production is the E. 

coli pET expression system (Novagen, Inc., Madison, WI). According to this expression 
system, DNA encoding a polypeptide is inserted into a pET vector in an orientation 
designed to allow expression. Since the gene encoding such a polypeptide is under the 
control of the T7 regulatory signals, expression of the polypeptide is achieved by 

20 inducing the expression of T7RN A polymerase in the host cell. This is typically 
achieved using host strains which express T7 RNA polymerase in response to IPTG 
induction. Once produced, recombinant polypeptide is then isolated according to 
standard methods known in the art, for example, those described herein. 

Another bacterial expression system for polypeptide production is the pGEX 

25 expression system (Pharmacia). This system employs a GST gene fusion system which 
is designed for high-level expression of genes or gene fragments as fusion proteins with 
rapid purification and recovery of functional gene products. The protein of interest is 
fused to the carboxyl terminus of the glutathione S-transferase protein from Schistosoma 
japonicum and is readily purified from bacterial lysates by affinity chromatography 

30 using Glutathione Sepharose 4B. Fusion proteins can be recovered under mild 
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conditions by elution with glutathione. Cleavage of the glutathione S-transferase 
domain from the fusion protein is facilitated by the presence of recognition sites for site- 
specific proteases upstream of this domain. For example, proteins expressed in pGEX- 
2T plasmids may be cleaved with thrombin; those expressed in pGEX-3X may be 
5 cleaved with factor Xa. 

Once the recombinant polypeptide of the invention is expressed, it is isolated, 
e.g., using affinity chromatography. In one example, an antibody (e.g., produced as 
described herein) raised against a polypeptide of the invention may be attached to a 
column and used to isolate the recombinant polypeptide. Lysis and fractionation of 

10 polypeptide-harboring cells prior to affinity chromatography may be performed by 
standard methods (see, e.g., Ausubel et al., supra). 

Once isolated, the recombinant protein can, if desired, be further purified, e.g., 
by high performance liquid chromatography (see, e.g., Fisher, Laboratory Techniques In 
Biochemistry And Molecular Biology, eds., Work and Burdon, Elsevier, 1980). 

15 Polypeptides of the invention, particularly short peptide fragments, can also be 

produced by chemical synthesis (e.g., by the methods described in Solid Phase Peptide 
Synthesis, 2nd ed., 1984 The Pierce Chemical Co., Rockford, IL), Also included in the 
invention are polypeptides which are modified in ways which do not abolish their 
pathogenic activity (assayed, for example as described herein). Such changes may 

20 include certain mutations, deletions, insertions, or post-translational modifications, or 
may involve the inclusion of any of the polypeptides of the invention as one component 
of a larger fusion protein. 

The invention further includes analogs of any naturally-occurring polypeptide of 
the invention. Analogs can differ from the naturally-occurring the polypeptide of the 

25 invention by amino acid sequence differences, by post-translational modifications, or by 
both. Analogs of the invention will generally exhibit at least 85%, more preferably 90%, 
and most preferably 95% or even 99% identity with all or part of a naturally-occurring 
amino acid sequence of the invention. The length of sequence comparison is at least 15 
amino acid residues, preferably at least 25 amino acid residues, and more preferably 

30 more than 35 amino acid residues. Again, in an exemplary approach to determining the 
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degree of identity, a BLAST program may be used, with a probability score between e" 3 
and e" 100 indicating a closely related sequence. Modifications include in vivo and in vitro 
chemical derivatization of polypeptides, e.g., acetylation, carboxylation, 
phosphorylation, or glycosylation; such modifications may occur during polypeptide 
5 synthesis or processing or following treatment with isolated modifying enzymes. 
Analogs can also differ from the naturally-occurring polypeptides of the invention by 
alterations in primary sequence. These include genetic variants, both natural and 
induced (for example, resulting from random mutagenesis by irradiation or exposure to 
ethanemethylsulfate or by site-specific mutagenesis as described in Sambrook, Fritsch 

10 and Maniatis, Molecular Cloning: A Laboratoiy Manual (2d ed.), CSH Press, 1989, or 
Ausubel et al, supra). Also included are cyclized peptides, molecules, and analogs 
which contain residues other than L-amino acids, e.g., D-amino acids or non-naturally 
occurring or synthetic amino acids, e.g., |3 or y amino acids. 

In addition to full-length polypeptides, the invention also includes fragments of 

1 5 any one of the polypeptides of the invention. As used herein, the term "fragment," 
means at least 5, preferably at least 20 contiguous amino acids, preferably at least 30 
contiguous amino acids, more preferably at least 50 contiguous amino acids, and most 
preferably at least 60 to 80 or more contiguous amino acids. Fragments of the invention 
can be generated by methods known to those skilled in the art or may result from normal 

20 protein processing (e.g., removal of amino acids from the nascent polypeptide that are 
not required for biological activity or removal of amino acids by alternative mRNA 
splicing or alternative protein processing events). The aforementioned general 
techniques of polypeptide expression and purification can also be used to produce and 
isolate useful peptide fragments or analogs (described herein). 

25 

Antibodies 

The polypeptides disclosed herein or variants thereof or cells expressing them 
can be used as an immunogen to produce antibodies immunospecific for such 
polypeptides. "Antibodies" as used herein include monoclonal and polyclonal 
30 antibodies, chimeric, single chain, simianized antibodies and humanized antibodies, as 
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well as Fab fragments, including the products of an Fab immunoglobulin expression 
library. 

To generate antibodies, a coding sequence for a polypeptide of the invention may 
be expressed as a C-terminal fusion with glutathione S-transferase (GST) (Smith et aL, 
5 Gene 67:3 1 , 1988). The fusion protein is purified on glutathione-Sepharose beads, 
eluted with glutathione, cleaved with thrombin (at the engineered cleavage site), and 
purified to the degree necessary for immunization of rabbits. Primary immunizations are 
carried out with Freund's complete adjuvant and subsequent immunizations with 
Freund's incomplete adjuvant. Antibody titres are monitored by Western blot and 

1 0 immunoprecipitation analyses using the thrombin-cleaved protein fragment of the GST 
fusion protein. Immune sera are affinity purified using CNBr-Sepharose-coupled 
protein. Antiserum specificity is determined using a panel of unrelated GST proteins. 

As an alternate or adjunct immunogen to GST fusion proteins, peptides 
corresponding to relatively unique immunogenic regions of a polypeptide of the 

1 5 invention may be generated and coupled to keyhole limpet hemocyanin (KLH) through 
an introduced C-terminal lysine. Antiserum to each of these peptides is similarly 
affinity purified on peptides conjugated to BSA, and specificity tested in ELISA and 
Western blots using peptide conjugates, and by Western blot and immunoprecipitation 
using the polypeptide expressed as a GST fusion protein. 

20 Alternatively, monoclonal antibodies which specifically bind any one of the 

polypeptides of the invention are prepared according to. standard hybridoma technology 
(see, e.g., Kohler et al., Nature 256:495, 1975; Kohler et al., Eur. J. Immunol 6:51 1, 
1976; Kohler et al., Eur. J. Immunol 6:292, 1976; Hammerling et al., In Monoclonal 
Antibodies and T Cell Hybridomas, Elsevier, NY, 1981; Ausubel et al., supra) . Once 

25 produced, monoclonal antibodies are also tested for specific recognition by Western blot 
or immunoprecipitation analysis (by the methods described in Ausubel et al., supra) . 
Antibodies which specifically recognize the polypeptide of the invention are considered 
to be useful in the invention; such antibodies may be used, e.g., in an immunoassay. 
Alternatively monoclonal antibodies may be prepared using the polypeptide of the 
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invention described above and a phage display library (Vaughan et al., Nature Biotech 
14:309,1996). 

Preferably, antibodies of the invention are produced using fragments of the 
polypeptides disclosed herein which lie outside generally conserved regions and appear 
5 likely to be antigenic, by criteria such as high frequency of charged residues. In one 
specific example, such fragments are generated by standard techniques of PCR and 
cloned into the pGEX expression vector (Ausubel et al., supra). Fusion proteins are 
expressed in E. coli and purified using a glutathione agarose affinity matrix as described 
in Ausubel et al. (supra). To attempt to minimize the potential problems of low affinity 

10 or specificity of antisera, two or three such fusions are generated for each protein, and 
each fusion is injected into at least two rabbits. Antisera are raised by injections in a 
series, preferably including at least three booster injections. 

Antibodies against any of the polypeptides described herein may be employed to 
treat bacterial infections, for example, those infections involving biofilm formation. 

15 Thus, among others, antibodies against, for example, polypeptides of PvrR (SEQ ID 
NO: 2), ORF1 (SEQ ID NO: 4), or ORF3 (SEQ ID NO: 6) shown respectively in 
Figures 5D, E, or F may be employed to treat infections, particularly bacterial infections 
and especially chronic infections associated with CF or biofilm formation associated 
with indwelling medical devices, conjunctivitis, pneumonia, and bacteremia. 

20 

Diagnostics 

In another embodiment, antibodies which specifically bind any of the 
polypeptides described herein may be used for the diagnosis of bacterial infection. A 
variety of protocols for measuring such polypeptides, including ELISAs, RIAs, and 
25 FACS, are known in the art and provide a basis for diagnosing bacterial infections. 

In another aspect, hybridization with PCR probes which are capable of detecting 
polynucleotide sequences, including genomic sequences, encoding pvrR, ORF1, ORF3, 
or closely related molecules may be used to identify nucleic acid sequences which 
encode its gene product. The specificity of the probe, whether it is made from a highly 
30 specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 
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conserved motif, and the stringency of the hybridization or amplification (maximal, 
high, intermediate, or low), will determine whether the probe identifies only naturally 
occurring sequences encoding PvrR, ORF1, or ORF3 allelic variants, or related 
sequences. 

5 In further embodiments, oligonucleotides or longer fragments derived from any 

of the polynucleotide sequences described herein may be used as targets in a microarray. 
The microarray can be used to monitor the expression level of large numbers of genes 
simultaneously and to identify genetic variants, mutations, and polymorphisms. This 
. information may be used to determine gene function, to understand the genetic basis of a 

10 disorder, to diagnose a disorder, and to develop and monitor the activities of therapeutic 
agents. Microarrays may be prepared, used, and analyzed using methods known in the 
art. (See, e.g., Brennan et al., U.S. Pat. No. 5,474,796; Schena et al, Proc. Natl Acad. 
Set 93:10614, 1996; Baldeschweiler et al., PCT application W095/251 116, 1995; 
Shalon, D. et al., PCT application WO95/35505, 1995; Heller et al, Proc. Natl Acad. 

15 Sci. 94:2150, 1997; and Heller et al., U.S. Pat. No. 5,605,662.) 

Screening Assays 

As discussed above, we have identified a biofilm regulator gene,/?wi?, of P. 
aeruginosa that mediates biofilm formation and antibiotic resistance by a microbe. 
20 Based on this discovery, we have developed screening assays for identifying compounds 
that enhance or inhibit the action of a polypeptide or the expression of a nucleic acid 
sequence of the invention. The method of screening may involve high-throughput 
techniques. 

Any number of methods are available for carrying out such screening assays. In 
25 one working example, candidate compounds are added at varying concentrations to the 
culture medium of pathogenic cells expressing one of the nucleic acid sequences of the 
invention. Gene expression is then measured, for example, by standard Northern blot 
analysis (Ausubel et al., supra) or RT-PCR, using any appropriate fragment prepared 
from the nucleic acid molecule as a hybridization probe. The level of gene expression in 
30 the presence of the candidate compound is compared to the level measured in a control 
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culture medium lacking the candidate molecule. A compound which promotes an 
increase in the expression of the pvrR gene or functional equivalent is considered useful 
in the invention; such a molecule may be used, for example, as a therapeutic to combat 
the pathogenicity of an infectious organism, for example, by decreasing its ability to 
5 form a biofilm and rendering it susceptible to antibiotic treatment. 

In another working example, the effect of candidate compounds may be 
measured at the level of polypeptide production using the same general approach and 
standard immunological techniques, such as Western blotting or immunoprecipitation 
with an antibody specific for a biofilm regulator polypeptide, such as PvrR. For 

10 example, immunoassays may be used to detect or monitor the expression of at least one 
of the polypeptides of the invention in a microbial organism. Polyclonal or monoclonal 
antibodies (produced as described above) which are capable of binding to such a 
polypeptide maybe used in any standard immunoassay format (e.g., ELISA, Western 
blot, or RIA assay) to measure the level of the polypeptide. A compound which 

1 5 promotes an increase in the expression of the polypeptide is considered particularly 

useful. Again, such a molecule may be used, for example, as a therapeutic to combat the 
biofilm formation of an organism as is described above. 

In yet another working example, candidate compounds may be screened for 
those which specifically bind to and agonize a PvrR polypeptide (a polypeptide having 

20 the amino acid sequences shown in Figure 5D) of the invention. The efficacy of such a 
candidate compound is dependent upon its ability to interact with the PvrR polypeptide 
or functional equivalent thereof. Such an interaction can be readily assayed using any 
number of standard binding techniques and functional assays (e.g., those described in 
Ausubel et al., supra). For example, a candidate compound may be tested in vitro for 

25 interaction and binding with a polypeptide of the invention and its ability to modulate 
biofilm formation may be assayed by any standard assay (e.g., those described herein). 

In one particular working example, a candidate compound that binds to a 
polypeptide (e.g, PvrR) may be identified using a chromatography-based technique. For 
example, a recombinant polypeptide of the invention may be purified by standard 

30 techniques from cells engineered to express the polypeptide (e.g., those described above) 
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and may be immobilized on a column. A solution of candidate compounds is then 
passed through the column, and a compound specific for the pathogenicity polypeptide 
(e.g, biofilm regulator polypeptide) is identified on the basis of its ability to bind to the 
pathogenicity polypeptide (e.g, biofilm regulator polypeptide) and be immobilized on 
5 the column. To isolate the compound, the column is washed to remove non-specifically 
bound molecules, and the compound of interest is then released from the column and 
collected. Compounds isolated by this method (or any other appropriate method) may, if 
desired, be further purified (e.g., by high performance liquid chromatography). In 
addition, these candidate compounds may be tested for their ability to render a pathogen 

10 incapable of forming a biofilm (e.g., as described herein). Compounds isolated by this 
approach may also be used, for example, as therapeutics to treat or prevent the onset of a 
pathogenic infection, disease, or both. Compounds which are identified as binding to 
pathogenicity polypeptides (e.g, biofilm regulator polypeptides) with an affinity constant 
less than or equal to 10 mM are considered particularly useful in the invention. 

1 5 Potential agonists include organic molecules, peptides, peptide mimetics, 

polypeptides, and antibodies that bind to a nucleic acid sequence or polypeptide of the 
invention (e.g, biofilm regulator polypeptides) and thereby increase its activity. 
Potential agonists also include small molecules that bind to and occupy the binding site 
of the polypeptide thereby preventing binding to cellular binding molecules, such that 

20 normal biological activity is prevented. 

Compounds that decrease only antibiotic resistance of a microbe are also 
identified by monitoring reversion of bacterial colonies from the antibiotic resistant 
phenotype to the wild-type susceptible phenotype. In one working example, screens for 
compounds that increase reversion rate are conducted by plating antibiotic resistant 

25 variant bacteria on antibiotic-free media in the presence or absence of a candidate 
compound. The plates are then cultured using standard methods. The plates are then 
visually inspected for revertants, i.e., bacterial colonies having a wild-type phenotype. 
The number of wild-type phenotype bacterial colonies is compared between plates 
cultured in the presence or absence of a candidate compound. Compounds that increase 
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the number of wild-type revertants, relative to the number of wild-type revertants on a 
control plate without the compound, are taken as useful in the invention. 

Additionally, compounds that decrease antibiotic resistance are identified by 
monitoring for an increase in the susceptibility of bacteria to antibiotics. In yet another 
5 working example, compounds that decrease antibiotic resistance are identified by plating 
wild-type bacteria on antibiotic containing plates in the presence or absence of a 
candidate compound. The plates are cultured using standard methods, and then visually 
inspected for bacterial colonies. The number of antibiotic resistant bacterial colonies is 
compared between plates cultured in the presence or absence of a candidate compound. 

10 Compounds that decrease the number of antibiotic resistant variant colonies, relative to 
the number of antibiotic resistant variant colonies on a control plate without the 
compound, are taken as useful in the invention. 

In another working example, a gene that regulates biofilm formation is identified 
by monitoring its activity or activity of its encoded polypeptide, when mutated. Bacteria 

15 are mutagenized using standard methods, such as transposon mutagenesis. Mutagenized 
and wild-type bacteria are then plated on antibiotic containing plates. These plates are 
cultured using standard methods, and then are visually inspected for the presence of 
antibiotic resistant variant bacteria. The number of antibiotic resistant variant bacterial 
colonies (e.g., small colony variants) is compared between mutagenized bacterial plates 

20 and wild-type control plates. This comparison is typically conducted when variant 

colonies begin to appear on the wild-type plate. A decrease or increase in the number of 
antibiotic resistant variant bacterial colonies (e.g., small colony variants) on a plate 
containing mutagenized bacteria is taken as an indication of the presence of a genetic 
mutation in a gene that regulates biofilm formation. The mutated gene is then identified 

25 according to standard methods. 

• In yet another working example, a gene that regulates biofilm or phenotype- 
mediated antibiotic resistance is identified as follows. For example, a candidate gene 
(e.g., as part of a genomic library) is introduced into a variant host cell (e.g., 
Pseudomonas aeinginosa PA14 RSCV). Next, the transformed host cell is monitored 

30 for reversion from the rough small colony variant phenotype to wild-type. The plates 
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are then cultured using standard methods and monitored for the appearance of colonies 
with a wild-type phenotype. The number of wild-type phenotype bacterial colonies is 
then compared between plates containing transformants and variants carrying the vector 
alone. An increase in the number of wild-type revertants, relative to the number of wild- 
5 type revertants on a control plate with the vector alone, identifies a gene that regulates 
biofilm formation or phenotype-mediated antibiotic resistance. A gene identified using 
this method is subsequently isolated using standard procedures known in the art. 

In another working example, small colony phenotypic variants are plated on an 
appropriate antibiotic medium (for example, those described herein) in the presence of a 

10 candidate compound and reversion to wild-type is monitored. Compounds that promote 
reversion from PAH RSCV to wild-type are taken as being useful in the invention. 

In another working example, a gene that regulates or is involved in phenotype- 
mediated or biofilm-mediated antibiotic resistance or biofilm formation is identified as 
follows. Bacteria are mutagenized using standard methods, such as transposon 

1 5 mutagenesis. Mutagenized bacteria are then plated on Trypticase Soy Agar (TS A) plates 
containing antibiotic. These plates are cultured using standard methods, and then 
inspected for bacterial growth. A decrease in the number of bacterial colonies or their 
absence on a mutagenized plate, relative to a control plate containing non-mutagenized 
bacteria identifies the presence of a genetic mutation in a gene that regulates phenotype- 

20 mediated or biofilm-mediated antibiotic resistance and biofilm formation. A gene 

identified using this method is subsequently isolated using standard procedures known in 
the art. 

In another working example, a gene that regulates or is involved in phenotype- 
mediated or biofilm-mediated antibiotic resistance or biofilm formation is identified as 

25 follows. Bacteria are mutagenized using standard methods, such as transposon 

mutagenesis. Mutagenized bacteria are then transferred to Trypticase Soy Broth (TSB) 
liquid culture media containing an antibiotic. The bacteria are then cultured using 
standard methods, and the cultures are inspected for the presence of bacterial growth. 
Bacterial growth is. compared between mutagenized cultures and wild-type control 

30 cultures. Bacterial growth can be identified, for example, by visual inspection, by 
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measuring optical density at 600 nm, or by other standard methods. The inability of a 
mutant to grow in liquid culture with antibiotics indicates the presence of a genetic 
mutation in a gene that regulates or is involved in phenotype-mediated or biofilm- 
mediated antibiotic resistance and biofilm formation. A gene identified using this 
5 method is subsequently isolated using standard procedures known in the art. 

In another working example, a gene that regulates or is involved in phenotype- 
mediated or biofilm-mediated antibiotic resistance or biofilm formation is identified as 
follows. Bacteria are mutagenized using standard methods, such as transposon 
mutagenesis. Mutagenized bacteria are then plated on TSA plates containing antibiotic. 

1 0 These plates are cultured using standard methods, and then inspected for bacterial 

growth. The inability of a mutant to grow in TSA supplemented with antibiotics is taken 
as an indication of the presence of a genetic mutation in a gene that regulates or is 
involved in phenotype-mediated or biofilm-mediated resistance and biofilm formation. 
A gene identified using this method is subsequently isolated using standard procedures 

15 known in the art. 

In another working example, a gene that regulates or is involved in phenotype- 
mediated or biofilm-mediated antibiotic resistance or biofilm formation is identified as 
follows. Bacteria are mutagenized using standard methods, such as transposon 
mutagenesis. Mutagenized bacteria are then transferred to liquid culture media TSB 

20 containing an antibiotic. The bacteria are then cultured using standard methods, and the 
cultures are inspected for the presence of bacterial growth. Bacterial growth is 
compared between mutagenized cultures and wild-type control cultures. Bacterial 
growth can be identified, for example, by visual inspection, by measuring optical density 
at 600 nm, or by other standard methods. The inability of a mutant to grow in liquid 

25 culture with antibiotics indicates the presence of a genetic mutation in a gene that 

regulates or is involved in phenotype-mediated or biofilm-mediated antibiotic resistance 
and biofilm formation. A gene identified using this method is subsequently isolated 
using standard procedures known in the art. 

Each of the DNA sequences provided herein may also be used in the discovery 

30 and development of antipathogenic compounds (e.g., antibiotics). The encoded protein, 



-58- 



WO 03/004689 



PCT/US02/21431 



upon expression, can be used as a target for the screening of antibacterial drugs. 
Additionally, the DNA sequences encoding the amino terminal regions of the encoded 
protein or Shine-Delgarno or other translation facilitating sequences of the respective 
. mRNA can be used to construct antisense sequences to control the expression of the 
5 coding sequence of interest. 

The antagonists and agonists of the invention may be employed, for instance, to 
inhibit and treat a variety of bacterial infections, for example, those involving biofilm 
formation. 

Optionally, compounds identified in any of the above-described assays may be 
10 confirmed as useful in conferring protection against the development of a pathogenic 
infection in any standard animal model (e.g., the mouse-burn assay described herein) 
and, if successful, may be used as anti-pathogen therapeutics (e.g, antibiotics). 

Small molecules of the invention preferably have a molecular weight below 
2,000 daltons, more preferably between 300 and 1,000 daltons, and most preferably 
15 between 400 and 700 daltons. It is preferred that these small molecules are organic 
molecules. 

Test Compounds and Extracts 

In general, compounds capable of reducing pathogenic virulence (e.g., reducing 

20 biofilm formation) are identified from large libraries of both natural product or synthetic 
(or semi-synthetic) extracts or chemical libraries according to methods known in the art. 
Those skilled in the field of drug discovery and development will understand that the 
precise source of test extracts or compounds is not critical to the screening procedure(s) 
of the invention. Accordingly, virtually any number of chemical extracts or compounds 

25 can be screened using the methods described herein. Examples of such extracts or 

compounds include, but are not limited to, plant-, fungal-, prokaryotic- or animal-based 
extracts, fermentation broths, and synthetic compounds, as well as modification of 
existing compounds. Numerous methods are also available for generating random or 
directed synthesis (e.g., semi-synthesis or total synthesis) of any number of chemical 

30 compounds, including, but not limited to, saccharide-, lipid-, peptide-, and nucleic acid- 
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based compounds. Synthetic compound libraries are commercially available from 
Brandon Associates (Merrimack, NH) and Aldrich Chemical (Milwaukee, WI). 
Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant, and 
animal extracts are commercially available from a number of sources, including Biotics 
5 (Sussex, UK), Xenova (Slough, UK), Harbor Branch Oceangraphics Institute (Ft. Pierce, 
FL), and PharmaMar, U.S.A. (Cambridge, MA). In addition, natural and synthetically 
produced libraries are produced, if desired, according to methods known in the art, e.g., 
by standard extraction and fractionation methods. Furthermore, if desired, any library or 
compound is readily modified using standard chemical, physical, or biochemical 
10 methods. 

In addition, those skilled in the art of drug discovery and development readily 
understand that methods for dereplication (e.g., taxonomic dereplication, biological 
dereplication, and chemical dereplication, or any combination thereof) or the elimination 
of replicates or repeats of materials already known for their anti-pathogenic activity 

1 5 should be employed whenever possible. 

When a crude extract is found to have an anti-pathogenic or anti-virulence 
activity, or a binding activity, further fractionation of the positive lead extract is 
necessary to isolate chemical constituents responsible for the observed effect. Thus, the 
goal of the extraction, fractionation, and purification process is the careful 

20 characterization and identification of a chemical entity within the crude extract having 
anti-pathogenic activity. Methods of fractionation and purification of such heterogenous 
extracts are known in the art. If desired, compounds shown to be useful agents for the 
treatment of pathogenicity are chemically modified according to methods known in the 
art. 

25 

Pharmaceutical Therapeutics 

The invention provides a simple means for identifying compounds (including 
peptides, small molecule inhibitors, and mimetics) capable of inhibiting the 
pathogenicity (e.g., biofilm formation) of a pathogen. Accordingly, a chemical entity 
30 discovered to have medicinal value using the methods described herein is useful as a 
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drug or as information for structural modification of existing anti-pathogenic 
compounds, e.g., by rational drug design. Such methods are useful for screening 
compounds having an effect on a variety of pathogens that form biofilms including, but 
not limited to, bacteria. Examples of pathogenic bacteria include, without limitation, 
5 Aerobacter, Aeromonas, Acinetobacter, Agrobacterium, Bacillus, Bacteroides, 
Bartonella, Bortella, Brucella, Calymmatobacterium, Campylobacter, Citrobacter, 
Clostridium, Cornyebacterium, Enterobacter, Enterococcus, Escherichia, Francisella, 
Haemophilus, Hafnia, Helicobacter, Klebsiella, Legionella, Listeria, Morganella, 
Moraxella, Proteus, Providencia, Pseudomonas, Salmonella, Serratia, Shigella, 

10 Staphylococcus, Streptococcus, Treponema, Xanthomonas, Vibrio, and Yersinia. 

For therapeutic uses, the compositions or agents identified using the methods 
disclosed herein may be administered systemically, for example, formulated in a 
pharmaceutically-acceptable buffer such as physiological saline. Treatment may be 
accomplished directly, e.g., by treating the animal with antagonists which disrupt, 

15 suppress, attenuate, or neutralize the biological events associated with a pathogenicity 
polypeptide (e.g., a biofilm regulator polypeptide). Preferable routes of administration 
include, for example, subcutaneous, intravenous, interperitoneally, intramuscular, or 
intradermal injections which provide continuous, sustained levels of the drug in the 
patient. Treatment of human patients or other animals will be carried out using a 

20 therapeutically effective amount of an anti-pathogenic agent in a physiologically- 
acceptable carrier. Suitable carriers and their formulation are described, for example, in 
Remington's Pharmaceutical Sciences by E.W. Martin. The amount of the anti- 
pathogenic agent (e.g., an antibiotic) to be administered varies depending upon the 
manner of administration, the age and body weight of the patient, and with the type of 

25 disease and extensiveness of the disease. Generally, amounts will be in the range of 

those used for other agents used in the treatment of other microbial diseases, although in 
certain instances lower amounts will be needed because of the increased specificity of 
the compound. A compound is administered at a dosage that inhibits microbial 
proliferation (e.g., biofilm formation). If desired, such treatment is also performed in 

30 conjunction with standard antibiotic therapy. 
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Other Embodiments 
In general, the invention includes any nucleic acid sequence which may be 
isolated as described herein or which is readily isolated by homology screening or PCR 
amplification using the nucleic acid sequences of the invention. Also included in the 
5 invention are polypeptides which are modified in ways which do not abolish their 
pathogenic activity (assayed, for example as described herein). Such changes may 
include certain mutations, deletions, insertions, or post-translational modifications, or 
may involve the inclusion of any of the polypeptides of the invention as one component 
of a larger fusion protein. Also, included in the invention are polypeptides that have lost 
1 0 their pathogenicity. 

Thus, -in other embodiments, the invention includes any protein which is 
substantially identical to a polypeptide of the invention. Such homologs include other 
substantially pure naturally-occurring polypeptides as well as allelic variants; natural 
mutants; induced mutants; proteins encoded by DNA that hybridizes to any one of the 
1 5 nucleic acid sequences of the invention under high stringency conditions or, less 
preferably, under low stringency conditions (e.g., washing at 2X SSC at 40°C with a 
probe length of at least 40 nucleotides); and proteins specifically bound by antisera of 
the invention. 

The invention further includes analogs of any naturally-occurring polypeptide of 
20 the invention. Analogs can differ from the naturally-occurring the polypeptide of the 
invention by amino acid sequence differences, by post-translational modifications, or by 
both. Analogs of the invention will generally exhibit at least 85%, more preferably 90%, 
and most preferably 95% or even 99% identity with all or part of a naturally-occurring 
amino acid sequence of the invention. The length of sequence comparison is at least 15 
25 amino acid residues, preferably at least 25 amino acid residues, and more preferably 
more than 35 amino acid residues. Again, in an exemplary approach to determining the 
degree of identity, a BLAST program may be used, with a probability score between e" 3 
and e" 100 indicating a closely related sequence. Modifications include in vivo and in vitro 
' chemical derivatization of polypeptides, e.g., acetylation, carboxylation, 
30 phosphorylation, or glycosylation; such modifications may occur during polypeptide 



-62- 



WO 03/004689 PCT7US02/21431 

synthesis or processing or following treatment with isolated modifying enzymes. 
Analogs can also differ from the naturally-occurring polypeptides of the invention by 
alterations in primary sequence. These include genetic variants, both natural and 
induced (for example, resulting from random mutagenesis by irradiation or exposure to 
5 ethanemethylsulfate or by site-specific mutagenesis as described in Sambrook, Fritsch 
and Maniatis, Molecular Cloning: A Laboratory Manual (2d ed.), CSH Press, 1989, or 
Ausubel et al., supra). Also included are cyclized peptides, molecules, and analogs 
which contain residues other than L-amino acids, e.g., D-amino acids or non-naturally 
occurring or synthetic amino acids. 

10 In addition to full-length polypeptides, the invention also includes fragments of 

any one of the polypeptides of the invention. As used herein, the term "fragment," 
means at least 5, preferably at least 20 contiguous amino acids, preferably at least 30 
contiguous amino acids, more preferably at least 50 contiguous amino acids, and most 
preferably at least 60 to 80 or more contiguous amino acids. Fragments of the invention 

1 5 can be generated by methods known to those skilled in the art or may result from normal 
protein processing (e.g., removal of amino acids from the nascent polypeptide that are 
not required for biological activity or removal of amino acids by alternative mRNA 
splicing or alternative protein processing events). 

Furthermore, the invention includes nucleotide sequences that facilitate specific 

20 detection of any of the nucleic acid sequences of the invention. Thus, for example, 
nucleic acid sequences described herein or fragments thereof may be used as probes to 
hybridize to nucleotide sequences by standard hybridization techniques under 
conventional conditions. Sequences that hybridize to a nucleic acid sequence coding 
sequence or its complement are considered useful in the invention. Sequences that 

25 hybridize to a coding sequence of a nucleic acid sequence of the invention or its 

complement and that encode a polypeptide of the inyention are also considered useful in 
the invention. As used herein, the term "fragment," as applied to nucleic acid sequences, 
means at least 5 contiguous nucleotides, preferably at least 10 contiguous nucleotides, 
more preferably at least 20 to 30 contiguous nucleotides, and most preferably at least 40 
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to 80 or more contiguous nucleotides. Fragments of nucleic acid sequences can be 
generated by methods known to those skilled in the art. 

The invention further provides a method for inducing an immunological response 
in an individual, particularly a human, which includes inoculating the individual with, 
5 for example, any of the polypeptides (or a fragment or analog thereof or fusion protein) 
of the invention to produce an antibody and/or a T cell immune response to protect the 
individual from infection, especially bacterial infection (e.g., a Pseudomonas aeruginosa 
infection). The invention further includes a method of inducing an immunological 
response in ah individual which includes delivering to the individual a nucleic acid 

1 0 vector to direct the expression of a polypeptide described herein (or a fragment or fusion 
thereof) in order to induce an immunological response. 

The invention also includes vaccine compositions including the polypeptides or 
nucleic acid sequences of the invention. For example, the polypeptides of the invention 
may be used as an antigen for vaccination of a host to produce specific antibodies which 

1 5 protect against invasion of bacteria. The invention therefore includes a vaccine 

formulation which includes an immunogenic recombinant polypeptide of the invention 
together with a suitable carrier. 

The invention further provides compositions (e.g., nucleotide sequence probes), 
polypeptides, antibodies, and methods for the diagnosis of a pathogenic condition. 

20 All publications and references, including but not limited to patents and patent 

applications, cited in this specification are herein incorporated by reference in their 
entirety as if each individual publication or reference were specifically and individually 
indicated to be incorporated by reference herein as being fully set forth. Any patent 
application to which this application claims priority is also incorporated by reference 

25 herein in its entirety in the manner described above for publications and references. 

What is claimed is: 
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Claims 

1 . An isolated polypeptide comprising an amino acid sequence having at 
least 50% identity to the amino acid sequence of PvrR (SEQ ID NO:2), wherein 

5 expression of said polypeptide, in a microorganism, affects phenotype-mediated 
antibiotic-resistance in said microorganism. 

2. The isolated polypeptide of claim 1 , said polypeptide comprising the 
amino acid sequence of PvrR (SEQ ID NO:2). 

10 

3. The isolated polypeptide of claim 1, wherein said amino acid sequence 
consists essentially of the amino acid sequence of PvrR (SEQ ID NO:2) or a fragment 
thereof. 

15 4. An isolated polypeptide fragment of the isolated polypeptide of claim 1 . 

5 . The isolated polypeptide fragment of claim 4, wherein said polypeptide 
fragment comprises 200 contiguous amino acids of SEQ ID NO:2. 

20 6. An isolated polynucleotide having at least 50% identity to the nucleotide 

sequence of pvrR (SEQ ID NO:l), wherein expression of said polynucleotide, in a 
microorganism, affects phenotype-mediated antibiotic-resistance in said microorganism. 

7. The isolated polynucleotide of claim 6, said polynucleotide comprising 
25 the nucleotide sequence of pvrR (SEQ ID NO:l) or a complement thereof. 

8. The isolated polynucleotide of claim 7, said polynucleotide consisting 
essentially of the nucleotide sequence of pvrR (SEQ ID NO:l) or a fragment thereof. 
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9. A vector comprising the isolated polynucleotide of any one of claims 6, 

7, or 8. 

1 0. A host cell comprising the vector of claim 9. 

11. A screening method for identifying a compound that modulates gene 
expression of a regulator polynucleotide that affects phenotype-mediated antibiotic- 
resistance in a microorganism, said method comprising the steps of: 

(a) providing a microbial cell comprising a polynucleotide having at least 50% 
identity to the nucleotide sequence of pvrR (SEQ ID NO:l), wherein expression of said 
polynucleotide, in said microbial cell, affects phenotype-mediated antibiotic-resistance 
in said microbial cell; 

(b) contacting said microbial cell with a compound; and 

(c) comparing the level of gene expression of said polynucleotide in the 
presence of said compound with the level of gene expression in the absence of said 
compound; wherein a measurable difference in gene expression indicates that said 
compound modulates gene expression of a regulator polynucleotide that affects 
phenotype-mediated antibiotic-resistance in a microorganism. 

1 2. The method of claim 1 1 , wherein said screening method identifies a 
compound that increases transcription of said regulator polynucleotide. 

1 3 . The method of claim 1 1 , wherein said screening method identifies a 
compound that decreases transcription of said regulator polynucleotide. 

1 4. The method of claim 1 1 , wherein said screening method identifies a 
compound that increases translation of an mRNA transcribed from said regulator 
polynucleotide. 
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1 5 . The method of claim 1 1 , wherein said screening method identifies a 
compound that decreases translation of an mRNA transcribed from said regulator 
polynucleotide. 

5 16. The method of claim 1 1 , wherein the compound is a member of a 

chemical library. 

17. The method of claim 11, wherein said microbial cell belongs to the genus 
Pseudomonas, Vibrio, Salmonella, or Staphylococcus. 

10 

1 8. The method of claim 1 1 , wherein said microbial cell is a phenotypic 
variant having increased biofilm formation. 

1 9. The method of claim 18, wherein said phenotypic variant is a small 
15 colony variant. 

20. The method of claim 19, wherein said small colony variant is a small 
colony variant of Pseudomonas, Vibrio, Salmonella, or Staphylococcus. 

20 21 . The method of claim 18, wherein said small colony variant is a rough 

small colony variant. 

22. The method of claim 2 1 , wherein said rough small colony variant is 
Pseudomonas, Vibrio, or Salmonella. 

25 

23. The method of claim 1 1 , wherein the activity of the compound is 
dependent upon the presence of the pvrR gene (SEQ ID NO:l) or a functional equivalent 
thereof. 
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24. The method of claim 1 1 , wherein said compound targets the pvrR gene 
(SEQ ID NO:l) or a functional equivalent thereof. 

25. The method of claim 1 1 , wherein expression of said polynucleotide 
5 mediates phenotypic switching of said microbial cell in the presence of a high 

concentration of an antibiotic. 

26. The method of claim 1 1 , wherein said polypeptide is expressed by the 
isolated polynucleotide of any one of claims 6, 7, or 8. 

10 

27. A screening method for identifying a compound that modulates an 
activity of a polypeptide that affects phenotype-mediated antibiotic-resistance in a 
microorganism, said method comprising the steps of: 

(a) providing a microbial cell expressing a polypeptide having at least 50% 

1 5 identity to the amino acid sequence of PvrR (SEQ ID NO:2), wherein expression of said 
polypeptide, in said microbial cell, affects phenotype-mediated antibiotic-resistance in 
said microbial cell; 

(b) contacting said microbial cell with a compound; and 

(c) comparing an activity of said polypeptide in the presence of said compound 
20 with said activity in the absence of said compound; wherein a measurable difference in 

the activity indicates that said compound modulates said activity of said polypeptide that 
affects phenotype-mediated antibiotic-resistance in a microorganism. 

*> 

28. The method of claim 27, wherein said screening method identifies a 
25 compound that increases the activity of said polypeptide. 

29. The method of claim 27, wherein said screening method identifies a 
compound that decreases the activity of said polypeptide. 
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30. The method of claim 27, wherein the compound is a member of a 
chemical library. 

3 1 . The method of claim 27, wherein comparing the activity of the 
5 polypeptide involves an immunological assay. 

32. The method of claim 27, wherein said microbial cell belongs to the genus 
Pseudomonas, Vibrio, Salmonella, or Staphylococcus. 

10 33. The method of claim 27, wherein said microbial cell is a phenotypic 

variant having increased biofilm formation. 

34. The method of claim 33, wherein said phenotypic variant is Pseudomonas 
aeruginosa PA14 RSCV. 

15 

35. The method of claim 27, wherein said regulator polypeptide is the 
isolated polypeptide of claim 1 . 

36. The method of claim 27, wherein the activity of the polypeptide regulates 
20 phenotypic switching. 

37. The method of claim 27, wherein the activity of the polypeptide regulates 
biofilm-mediated antibiotic-resistance. 

25 38. The method of claim 27, wherein the activity of the polypeptide affects 

susceptibility of the microbial cell to antibiotic treatment. 

•39. The method of claim 27, wherein said polypeptide is an element of a two- 
component regulatory system. 

30 
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40. The method of claim 27, wherein the activity of the compound is 
dependent upon the presence of the PvrR polypeptide (SEQ ID NO: 2) or a functional 
equivalent thereof. 

5 41. The method of claim 27, wherein said compound targets the PvrR 

polypeptide (SEQ ID NO:2) or a functional equivalent thereof. 

42. The method of claim 27, wherein said polypeptide mediates phenotypic 
switching of said microbial cell in the presence of a high concentration of an antibiotic. 

10 

43. The method of claim 27, wherein said polypeptide is expressed by the 
isolated polynucleotide of any one of claims 6, 7, or 8. 

44. A. screening method for identifying a compound that modulates microbial 
15 biofilm formation, said method comprising the steps of: 

(a) culturing a microbial cell comprising a polypeptide having at least 50% 
identity to the amino acid sequence of PvrR (SEQ ID NO:2), wherein said microbial 
cell, upon culturing, forms a biofilm; 

(b) contacting said microbial cell with a compound; and 

20 (c) comparing microbial biofilm formation in the presence of said compound 

with microbial biofilm formation in the absence of said compound; wherein a 
measurable difference in said microbial biofilm formation indicates that said compound 
modulates biofilm formation. 



25 45. The method of claim 44, wherein said screening method identifies a 

compound that increases biofilm formation. 

46. The method of claim 44, wherein said screening method identifies a 
compound that decreases biofilm formation. 

30 
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47. The method of claim 44 3 wherein biofilm formation is measured by 
assaying microbial aggregation. 

48. The method of claim 47, wherein microbial aggregation is assayed using 
5 a microscope. 

49. The method of claim 47, wherein microbial aggregation is assayed using 
a salt aggregation test. 

1 0 50. The method of claim 47, wherein microbial aggregation is assayed using 

an attachment assay. 

5 1 . The method of claim 44, wherein the compound is a member of a 
chemical library. 

15 

52. The method of claim 44, wherein said microbial cell belongs to the genus 
Pseudomonas, Vibrio, Salmonella, or Staphylococcus. 

53. The method of claim 44, wherein said microbial cell is a phenotypic 
20 variant having increased biofilm formation. 

54. The method of claim 53, wherein said phenotypic variant is a small 
colony variant. 

25 55. The method of claim 54, wherein said small colony variant is a small 

colony variant of Pseudomonas, Vibrio, Salmonella, or Staphylococcus. 

56. The method of claim 54, wherein said small colony variant is a rough 
small colony variant. 

30 
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57. The method of claim 56, wherein said rough small colony variant is 
Pseudomonas, Vibrio, or Salmonella. 

58. The method of claim 44, wherein the activity of the compound is 

5 dependent upon the presence of PvrR polypeptide (SEQ ID NO: 2) or a functional 
equivalent thereof. 

59. The method of claim 44, wherein said compound targets the PvrR 
polypeptide (SEQ ID NO:2) or a functional equivalent thereof. 

10 

60. The method of claim 44, wherein expression of said polypeptide mediates 
phenotypic switching of said microbial cell in the presence of a high concentration of an 
antibiotic. 

15 61. The method of claim 44, wherein said polypeptide is an isolated 

polypeptide of any one of claims 1, 2, or 3. 

62. A method of treating a microbial infection involving a microorganism 
that forms a biofilm in a mammal, said method comprising administering to said 

20 mammal a therapeutically-effective amount of a compound that induces the expression 
of or activity of or represses the expression of or activity of the polypeptide of any one 
of claims 1, 2, or 3. 

63 . The method of claim 62, wherein said method further comprises 
25 administering to said mammal a therapeutically-effective amount of an antibiotic. 

64. The method of claim 62, wherein said mammal is a human. 

65. The method of claim 62, wherein said human has cystic fibrosis. 

30 
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66. The method of claim 62, wherein said human has a chronic infection. 

67. The method of claim 62, wherein the said microorganism belongs to the 
genus Pseudomonas, Vibrio, Salmonella or Staphylococcus. 

5 

68. A method of cleaning or disinfecting a surface at least partially covered 
by a microorganism that forms a biofilm, said method comprising contacting said 
microorganism with a cleaning composition comprising a compound that induces the 
expression of or activity of or represses the expression of or activity of the polypeptide 

10 of claim 1,2, or 3. 

69. The method of claim 68, wherein said microorganism belongs to the 
genera Pseudomonas, Vibrio, Salmonella or Staphylococcus. 

15 70. A screening method for identifying a compound that decreases 

pathogenicity of an antibiotic-resistant phenotypic variant, said method comprising the 
steps of: 

(a) contacting an antibiotic-resistant phenotypic variant with a candidate 
compound; and 

20 (b) measuring reversion of said antibiotic-resistant phenotypic variant to a wild- 

type phenotype, an increase in reversion indicating that said compound decreases 
pathogenicity of said antibiotic-resistant phenotypic variant. 

7 1 . Thp method of claim 70, wherein said antibiotic-resistant phenotypic 
25 variant is a bacterial variant. 

72. The method of claim 71 , wherein said antibiotic-resistant phenotypic 
bacterial variant is cultured in the absence of an antibiotic. 
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73 . The method of claim 7 1 , wherein said antibiotic-resistant phenotypic 
bacterial variant has increased biofilm formation. 

74. The method of claim 71 , wherein said antibiotic-resistant phenotypic 
5 bacterial variant is a rough small colony variant. 

75. The method of claim 71 , wherein said antibiotic-resistant phenotypic 
bacterial variant is a hyperpiliated variant. 

10 76. The method of claim 7 1 , wherein said antibiotic-resistant phenotypic 

bacterial variant has increased hydrophobicity . 

77 . The method of claim 7 1 , wherein said antibiotic-resistant phenotypic 
bacterial variant has an alteration in a surface component. 

15 

78. The method of claim 71, wherein said antibiotic-resistant phenotypic 
bacterial variant is a pathogen. 

79. The method of claim 78, wherein said pathogen is a Gram positive 
20 bacterium. 

80. The method of claim 79, wherein said pathogen is Staphylococcus. 

8 1 . The method of claim 78, wherein said pathogen is a Gram negative 
25 bacterium. 

82. The method of claim 75, wherein said pathogen is Vibrio, Pseudomonas, 
or Salmonella. 
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83. A screening method for identifying a compound that decreases 
pathogenicity of a wild-type microbe, said method comprising the steps of: 

. (a) culturing a wild-type microbe with a candidate compound in the presence of 
an antibiotic; and 

5 (b) comparing the number of antibiotic-resistant phenotypic variants in the 

presence of said compound to the number of antibiotic-resistant phenotypic variants in 
the absence of said compound, a decrease in the number of said antibiotic-resistant 
phenotypic variants in the presence of said compound indicating that said compound 
decreases pathogenicity of said wild-type microbe. 

10 

84. A screening method for identifying a polynucleotide encoding a regulator 
polypeptide that modulates an antibiotic-resistant phenotype of a microorganism, said 
method comprising the steps of: 

(a) identifying an antibiotic-resistant phenotypic variant of a microorganism 
1 5 comprising a first phenotype; 

(b) mutagenizing said antibiotic-resistant phenotypic variant of said 
microorganism, thereby generating a mutated phenotypic variant of said microorganism; 
and 

(c) selecting said mutated phenotypic variant of step (b) having a second 

20 phenotype, other than the first phenotype of said antibiotic-resistant phenotypic variant, 
wherein said second phenotype identifies a mutation in said mutated phenotypic variant 
of step (b); and 

(d) using said mutation for identifying a polynucleotide encoding a regulator 
polypeptide that modulates an antibiotic-resistant phenotype of a microorganism. 

25 

85. The method of claim 84, wherein said second phenotype comprises a 
wild-type phenotype. 
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86. A screening method for identifying a polynucleotide encoding a regulator 
polypeptide that modulates phenotype-mediated antibiotic-resistance of a 
microorganism, said method comprising the steps of: 

(a) transforming an antibiotic-resistant phenotypic variant of a microorganism 
5 with a candidate polynucleotide encoding a regulator polypeptide; and 

(b) culturing said transformed antibiotic-resistant phenotypic variant of a 
microorganism under conditions suitable for expression of said regulator polypeptide; 
and 

(c) measuring reversion of said transformed antibiotic-resistant phenotypic 
1 0 variant of said microorganism to a wild-type phenotype, an increase in reversion 

identifies said polynucleotide as encoding a regulator polypeptide that modulates 
phenotype-mediated antibiotic-resistance. 

87. The method of claim 80, wherein said polynucleotide encodes a regulator 
1 5 polypeptide that modulates a phenotypic switch from antibiotic-resistant phenotype to an 

antibiotic-susceptible phenotype. 

88. The method of claim 80, wherein said polynucleotide having at least 50% 
identity to the nucleotide sequence of pvrR (SEQ ID NO:l) encodes an element of a 

20 two-component regulatory system. 



-76- 



WO 03/004689 



PCT/US02/21431 




Figures 2 A*B 
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Figure 5A, pvrR (SEQ ID NO:l) 

atgagctgga aatcctatcg ggtgctggtg gtcgaagatc agccgtttca gcgcgaatac 60 
ctgctcaacc tgtttcgcga gcgcggcgtg cagtacctgg taggtgccgg cgacggcgcg 120 
qaggcgttgc gctgcctgaa gcaggacagg ttcgacctga tcctcagcga tctgatgatg 180 
ccgggcatgg atggtatcca aatgatcctg caactgccgt atctcaagca tcgtccgaag 240 
ctggcgctga tgagctcctc gtcgcagcgg atgatgctca gtgccagccg ggtcgcccag 300 
agtctcggct tgtcggtaat cgacctgttg cccaagccga ctctgcccaa ggccatcggc 360 
caacttctgg aacacctgga aagatgcctc aggcagaagc tggagccgga aaccgacgag 420 
actccgcatg ggcgcacggc gttgctggat gccctgcata acgagcaact ggtgacctgg 480 
ttccaggcta agaaatccct ccacaccggg cgcatagtcg gcgccgaggc gttgatacgc 540 
tggagccacc cgcagcatgg cctgttgctg cccagctgtt tcatgagtga tgtcgacgct 600 
accggtctgc acgaggcgtt gctctggcgc gtgctcgaac agaccctgaa cgcccaggaa 660 
tcgtggcgca gggcgggtta cgagattccg gtttcggtga atctgccgcc gcacctgctc 720 
gataaccagg aacttccgga tcgactctat gagtacgtcg gcgctcgcgg ggcttgtacc 780 
agctcactat gtttcgagtt gaccgagagc agtgtcacaa ctctgtcaag taactactat 840 
qcaggtgcct gtcgcttgcg catgaaaggg ttcggattgg cccaggacga ctttggccag 900 
qgttacagct cgttctataa cctggtcacg acgcctttca cggagctgaa gatcgaccgc 960 
tccctagtcc agggatgcgt agaggataac ggcctcaatg cagctgtcat cagttgtatt . 1020 
qagttgggtc accgcctgaa tctcgacgtg gtggccgaag gcgtggagac ctgcgaggaa 1080 
ctgaatcttc ttcgtcgtct tggctgcgac cgggcgcagg gtttcctgat ttctaaggca 1140 
gtgtctgctc gtgagttcga gcggcagtta agggaggacg gccccagcct ccttgtttaa 1200 
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ORF1-12 
SEQ ID NO; 3 

atgagccatg aaatccgcac acctctgtac 
cgtaccgagc tgagtcggca gcaggccggt 
accctgctgc aactgatcag cgatgtgctt 
gacctagagt gcgtggaatt ctccccgttg 
accggtgccg cgcaggccaa ggggctgcag 
ctgcgcatgc ggggggccgc ggcgtcgatc 
gcggtgaagt tcaccgacaa tggctatgtc 
gccgaatgtg tgatgctgac ctggcaggtc 
gatcagccgc gtctgttcga accgttctac 
ggcacgggcc tcggcttgtc gatcagccag 
aaactggtca gtgagctggg gttgggcagc 
atcgcgatgc aggctgagcc gcaggaccta 
gtccgcgacc taacggaatg cctgtgtggc 
gtcgcgacgc cgaggtcgct ggacgaggcg 
ctgctggagg gggcgccgat gttcgaagca 
cagggtgata tggagccgca ggcacagggc 
gacggcctgc atcgtgctct gggcctggcc 
ccgatacggc tggctccgtt gcgcaatcta 
gcgatcaacc agttgatctt gagggaccag 
ctcttcgatg gtcgcgaggc gttgctgcac 
accgatatca acatgccgaa catgaacgga 
gggttccggc agccgatcat cggcgcgacg 
.tgcatgtccg ccgggatgaa cgattgcctg 
aactgcttga ttaatattct caaggtggat 
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ggcatgcttg gcacgcttga gctgcttggg 60 
tacctaaagg caatccagca ttcctcgtcg 120 
gacgtatcca agatagaggc cggccaactg 180 
gaattgaccg aagaggtcgt gcagtcgttc 24 0 
ttgtatacct gcctctctgc ggagctgccg 3 00 
cggcagattc tcaacaacct gctgagcaac 360 
aacgtccacc tgaaggccag cgtggtcgat 420 
aacgataccg gcatggggat caacgtcgag 480 
cagatacgcc gctccgagca tccggtcgca 540 
cgcctggcgc agctaatgaa tggcagtctg 600 
agctttagcc tcaggcttcc gcttgagcgg 660 
gccgggtgcg ccgtccaagt gctggcgcct 720 
tggatctccc gctggggtgg aagggccatg 78 0 
gacgcgacct cgctgctggt cgaagtgtta 84 0 
tggccaggat gccgggtgga gctttcccct 900 
cgcgactggc tgctcgggct caacaacctg 960 
catgggcgtc tcgctgatcc ttcgacgccg 1020 
ggtctccgcg tcctagtggt ggaggataac 1080 
atggaagcgc tgggctgcag cgtggagctg 1140 
tgccagacgg cctgcttcga cgtggtgctc 1200 
tacgagctaa ccgcggagct' acggcgccaa 1260 
gcgaacgcca tgcgtgagga gcgcgagcgc 1320 
gtcaaaccgg tggatctgaa tgcccttcag 13 80 
cgatga 1416 
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Figure 5C ORF3 (SEQ ID NO: 5) 

atgatggatg ttatacggga gcatgaggta tttcttgggc gcatcgctcg aaaaagcgac 60 
aagaccaccc agaagtacga ctatgacgtg gtgcctttgc agcggcactt gttggcaaag 120 
gaaaacggat tagcggtcta tgagggacgg gagttttcct ttgctatgcc atttctactg 180 
gctaccaagc acgcgttgag cgccgattcc tcgggagatc cgttttcgct cggtgtattg 240 
ctcgccaatt tctacggaag cttctggagt gtttccgcct atcccgcgcc acagttactg 300 
atctttgatc tttccggcag cacccgcctg gcagtgccgt cgattccctc cacagcgcag 360 
cgtgacaggt tgagcggaag ctatccgatg atagtcgagc gcattctggc gcgcttgcgc 420 
acccggccgg tgggggagga cgctcagcgt gtccattgga tacgcgctga tcgctatcgc 480 
gactcggcgc tggagatgtt gggagtcgcc cgggttgatc tgccggaaac actctggtgg 540 
cacgacgagc cgaaccatct gatcatcgct gcgagcctgc ttgatctcag gcgaatcaat 600 
gacttcgaac agttggttga gcgcccggca ttcgattcgt acagcctggt atcgccggat 660 
ggcgaggtat tgctcggcgc ggcccctgcg accggcctga gggatggcct gaacctcacc 720 
cgacaggggg tcgccgttca actgcgcagc cagcctgaga acggctggct cgcggtctac 780 
cgaaccgact acggcaattt ctttcgccac tcccggtggc tggtggcagg tctgctgctg 840 
accccggcgc tgctcctggc cggttggctc gggatgcgtt ggtacaccag cagcgtcgtc 900 
aacccggtgc atcgggcgca ccggcaactg gtggagagcg acaccttcag ccggacgctg 960 
atacagaccg cgccggtggc tctggtggtg ctgacccagg atgaccagca actggtgacc 1020 
tgcaaccact tggccgccca gtggctgggc gggcccacgg agatccttgg gctgacttcc 1080 
aactggaagc ttttcgatgc gcgtgggcag gtaccaggag acatctgtat ccaggtcggt 1140 
gggcgctatt tgcagaccgc cttcgcggcg acccgctatg ccggcaccga ggcggtactg 1200 
tgcgtattca acgacatcac ggtccactgc gaggcggaga ccgcgctgtc caatgcgaag 1260 
cgagcagcgg atgccgccag ccaggccaag accctgttcc tggcccgcat gagccatgaa 1320 
atccgtactc ccctgtacgg tgtccttggc accctggagt tgctcgacct gaccaccctg 1380 
aacgagcggc aacgcgccta cctacgcacc atccagagtt cgtctgcgac gctcatgcaa 1440 
ctgattagcg atgtgctgga tgtctcgaag atcgaagcgg ggcagatggc tctgaccctg 1500 
gccgccttca atccgctgga cctagtgcgg gaagtgcttg gcaactttgc cgccagcgcc 1560 
atggccaagg acctgcaggt agacccgctc gatactcttg cgcttgaggc gcaggtcgcg 1620 
catggcttcg aagaaagcgt tctgttcgag gttgctggtg gctcggtcgg ccatttcgaa 1680 
gagggtgtcg tcggcgttgt cgaacaacgc ctgcaacgcc tgtttcagct gcagcgccgc 1740 
cttgtcgcgc acctgcacga ggatgaccgg caggcgcccc gctccggcgt tcggcgacgg 1800 
ctcggaagcg accctggtca ggtgcaccac attggcatcg ttctgcatcg ggactctcct 1860 
gccafccctcg cggccgcgca tggaatggca aaaatcgggc acagaggatc gattggcgtc 1920 
gtccgtaacg tcaatttcca ggcgtcaaaa acaagtatct acattcatta tagagatact 1980 
ttcaaatcta gatag 1995 
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Figure 5D PvrR (SEQ ID NO: 2) 

t 

Met Ser Trp Lys Ser Tyr Arg Val Leu Val Val Glu Asp Gin Pro Phe 

1 5 10 15 

Gin Arg Glu Tyr Leu Leu Asn Leu Phe Arg Glu Arg Gly Val Gin Tyr 

20 25 30 

Leu Val Gly Ala Gly Asp Gly Ala Glu Ala Leu Arg Cys Leu Lys Gin 

35 40 45 

Asp Arg Phe Asp Leu lie Leu Ser Asp Leu Met Met Pro Gly Met Asp 

50 * 55 60 

Gly He Gin Met He Leu Gin Leu Pro Tyr Leu Lys His Arg Pro Lys 
65 70 75 80 

Leu Ala Leu Met Ser Ser Ser Ser Gin Arg Met Met Leu Ser Ala Ser 

85 90 95 

Arg Val Ala Gin Ser Leu Gly Leu Ser Val He Asp Leu Leu Pro Lys 

100 105 HO 

Pro Thr Leu Pro Lys Ala He Gly Gin Leu Leu Glu His Leu Glu Arg 

115 120 125 

Cys Leu Arg Gin Lys Leu Glu Pro Glu Thr Asp Glu Thr Pro His Gly 

130 140 
Arg Thr Ala Leu Leu Asp Ala Leu His Asn Glu Gin Leu Val Thr Trp 
145 150 155 160 

Phe Gin Ala Lys Lys Ser Leu His Thr Gly Arg He Val Gly Ala Glu 

165 170 175 

Ala Leu He Arg Trp Ser His Pro Gin His Gly Leu Leu Leu Pro Ser 

180 185 190 

Cys Phe Met Ser Asp Val Asp Ala Thr Gly Leu His Glu Ala Leu Leu 

195 200 205 

Trp Arg Val Leu Glu Gin Thr Leu Asn Ala Gin Glu Ser Trp Arg Arg 

210 215 220 • 

Ala Gly' Tyr Glu He Pro Val Ser Val Asn Leu Pro Pro His Leu Leu 

225 230 235 240 

Asp Asn Gin Glu Leu Pro Asp Arg Leu Tyr Glu Tyr Val Gly Ala Arg 

245 250 255 

Gly Ala Cys Thr Ser Ser Leu Cys Phe Glu Leu Thr Glu Ser Ser Val 

260 265 270 

Thr Thr Leu Ser Ser Asn Tyr Tyr Ala Gly Ala Cys Arg Leu Arg Met ■ 

275 280 285 

Lys Gly Phe Gly Leu Ala Gin Asp Asp Phe Gly Gin Gly Tyr Ser Ser 

290 295 300 

Phe Tyr Asn Leu Val Thr Thr Pro Phe Thr Glu Leu Lys He Asp Arg 
305 310 315 320 

Ser Leu Val Gin Gly Cys Val Glu Asp Asn Gly Leu Asn Ala Ala Val 

325 " 330 335 

He Ser Cys lie Glu Leu Gly His Arg Leu Asn Leu Asp Val Val Ala 

340 345 350 

Glu Gly Val Glu Thr Cys Glu Glu Leu Asn Leu Leu Arg Arg Leu Gly 

355 360 365 

Cys Asp Arg Ala Gin Gly Phe Leu He Ser Lys Ala Val Ser Ala Arg 

370 375 380 

Glu Phe Glu Arg Gin Leu Arg Glu Asp Gly Pro Ser Leu Leu Val 
385 390 395 
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ORF1-12 








SEQ 


ID NO:4 






Met 


ber 


His 


Glu 


He 


1 








5 


Glu 


Leu 


Leu Gly Arg 








20 




Lys 


TV 1 - 

Ala 


He 


Gin 


His 




35 






Val 


Leu 


Asp 


Val 


Ser 




50 








Val 


Glu 


Phe 


Ser 


Pro 


65 










Thr 


Gly 


Ala Ala 


Gin 








85 


Ala 


GlU 


Leu 


Pro 


Leu 








100 




He 


Leu 


Asn 


Asn 


Leu 






115 






Tyr 


Val 


Asn 


val 


His 


130 








Met 


Leu 


Thr Trp Gin 


145 










Asp 


Gin 


Pro Arg 


Leu 








165' 


His 


Pro 


Val 


Ala 


Gly 








180 




Ala 


Gin 


Leu 


Met 


Asn 






195 






Gly 


Ser 


Ser 


Phe 


Ser 


210 








Ala 


Glu 


Pro Gin Asp 


225 










Val 


Arg 


Asp 


Leu 


Thr 








245 


Gly 


Arg 


Ala 


Met 


Val 








260 




Thr 


Ser 


Leu 


Leu 


Val 






275 






Glu 


Ala 


Trp Pro Gly 




290 








Glu 


Pro 


Gin 


Ala 


Gin 


305 










Asp 


Gly 


Leu 


His 


Arg 










325 


Pro 


Ser 


Thr 


Pro 


Pro 








340 




Arg 


val 


Leu 


Val 


Val 




355 






Asp 


Gin 


Met 


Glu 


Ala 




370 








Arg 


Glu 


Ala 


Leu 


Leu 


385 










Thr 


Asp 


He 


Asn 


Met 










405 


Leu 


Arg 


Arg Gin Gly 








420 




Ala 


Met 


Arg 


Glu 


Glu 






435 






Cys 


Leu 


Val 


Lys 


Pro 



Figure 5E 



Arg Thr 


Pro Leu 


Tyr Gly 






10 




Thr Glu 


Leu Ser 


Arg 


Gin 




25 






Ser Ser 


Ser Thr 


Leu 


Leu 




40 






Lys He 


.Glu Ala 


Gly Gin 


55 








Leu Glu 


Leu Thr 


Glu 


Glu 


70 






75 


Ala Lys 


Gly Leu 


Gin 


Leu 






90 




Ara Met 


Ara Glv 


Ala 


Ala 




105 






Leu Ser 


Asn Ala 


Val Lys 










Leu Lvs 


Ala Ser 


Val 


Val 










vol noii 


Asp Thr 


Gly Met 


•LDU 






155 


XT I JC VJJ. u 


rxu rue 


Tyr 


Gin 






170 




inx v?j.y 


ucu uxy 


Leu 


Ser 




loo 






fil v .Cot* 


UCU XJ jr p 


Leu 


Val 




A V u 






Leu Arg 


T.P11 PT"0 


Leu 


Glu 


Z ID 








Ucu Aid 




Ala 


Val 


£. J V 






235 


Glu Cys 


jjcu \.yb 


Gly Trp 






250 




M-L O XXIX 


"Dim Avri 
JcXU niy 


Ser 


Leu 










Glu Val 


Tien Leu 


Leu 


Glu 




*5 R 0 
Z o U 






a ni. y 


Val Glu 


Leu 


Ser 










wxy /*x y 


7\en r P->— o 

nop 


Leu 


Leu 


•lift 






315 


Ala Leu 


uxy xjcu 


Ala 


His 






330 




Tie Ara 
lie /*x y 


T«pu Al a 


Pro 


Leu 








Glu Asn 


Asn Ala 


He 


Asn 




360 






Leu Gly 


Cys Ser 


Val 


Glu 


.375 








His Cys 


Gin Thr 


Ala 


Cys 


390 






395 


Pro Asn 


Met Asn 


Gly Tyr 






410 




Phe Arg 


Gin Pro 


He 


He 




425 






Arg Glu 


Arg Cys 


Met 


Ser 




440 






Val Asp 


Leu Asn 


Ala 


Leu 



Met Leu Gly Thr Leu 
15 

Gin Ala Gly Tyr Leu 
30 

Gin Leu He Ser Asp 
45 

Leu Asp Leu Glu Cys 
60 

Val Val Gin Ser Phe 
80 

Tyr Thr Cys Leu Ser 
95 

Ala Ser He Arg Gin 
110 

Phe Thr Asp Asn Gly 
125 

Asp Ala Glu Cys Val 
140 

Gly He Asn Val Glu 
160 

He Arg Arg Ser Glu 
175 

He Ser Gin Arg Leu 
190 

Ser Glu Leu Gly Leu 
205 

Arg He Ala Met Gin 
220 

Gin Val Leu Ala Pro 
240 

He Ser Arg Trp Gly 
255 

Asp Glu Ala Asp Ala 
270 

Gly Ala Pro Met Phe 
285 

Pro Gin Gly Asp Met 
300 

Gly Leu Asn Asn Leu 
320 

Gly Arg Leu Ala Asp 
335 

Arg Asn Leu Gly Leu 
350 

Gin Leu He Leu Arg 
365 

Leu Leu Phe Asp Gly 
380 

Phe Asp Val Val Leu 
400 

Glu Leu Thr Ala Glu 
415 

Gly Ala Thr Ala Asn 
430 

Ala Gly Met Asn Asp 
445 

Gin Asn Cys Leu lie 
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450 1 455 

Asn He Leu Lys Val Asp Arg 
465 470 



460 
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Figure 5F, ORF3 (SEQ ID NO: 6) 

Met Met Asp Val He Arg Glu His Glu Val Phe Leu Gly Arg He Ala 

1 5 10 15 

Arg Lys Ser Asp Lys Thr Thr Gin Lys Tyr Asp Tyr Asp Val Val Pro 

•* 20 25 30 

Leu Gin Arg His Leu Leu Ala Lys Glu Asn Gly Leu Ala Val Tyr Glu 

35 40 45 

Gly Arg Glu Phe Ser Phe Ala Met Pro Phe Leu Leu Ala Thr Lys His 

50 .55 60 

Ala Leu Ser Ala Asp Ser Ser Gly Asp Pro Phe Ser Leu Gly Val Leu 
65 70 75 .80 

Leu Ala Ash Phe Tyr Gly Ser Phe Trp Ser Val Ser Ala Tyr Pro Ala 

85 90 95 

Pro Gin Leu Leu He Phe Asp Leu Ser Gly Ser Thr Arg Leu Ala Val 

100 . 105 110 

Pro Ser He Pro Ser Thr Ala Gin Arg Asp Arg Leu Ser Gly Ser Tyr 

115 120 125 

Pro Met He Val Glu Arg He Leu Ala Arg Leu Arg Thr Arg Pro Val 

130 135 140 

Gly Glu Asp Ala Gin Arg Val His Trp He Arg Ala Asp Arg Tyr Arg 
145 150 155 160 

Asp Ser Ala Leu Glu Met Leu Gly Val Ala Arg Val Asp Leu Pro Glu 

165 170 175 

Thr Leu Trp Trp His Asp Glu Pro Asn His Leu He He Ala Ala Ser 

180 185 190 

Leu Leu Asp Leu Arg Arg lie Asn Asp Phe Glu Gin Leu Val Glu Arg 

195 200 205 

Pro Ala Phe Asp Ser Tyr Ser Leu Val Ser Pro Asp Gly Glu Val Leu 

210 215 220 

Leu Gly Ala Ala Pro Ala Thr Gly Leu Arg Asp Gly Leu Asn Leu Thr 
225 230 235 240 

Arg Gin Gly Val Ala Val Gin Leu Arg Ser Gin Pro Glu Asn Gly Trp 

245 250 255 

Leu Ala Val Tyr Arg Thr Asp Tyr Gly Asn Phe Phe Arg His Ser Arg 

260 265 270 

Trp Leu Val Ala Gly Leu Leu Leu Thr Pro Ala Leu Leu Leu Ala Gly 

275 280 285 

Trp Leu Gly Met Arg Trp Tyr Thr Ser Ser Val Val Asn Pro Val His 

290 295 300 

Arg Ala His Arg Gin Leu Val Glu Ser Asp Thr Phe Ser Arg Thr Leu 
305 310 315 320 

He Gin Thr Ala Pro Val Ala Leu Val Val Leu Thr Gin Asp Asp Gin 

325 330 335 

Gin Leu Val Thr Cys Asn His Leu Ala Ala Gin Trp Leu Gly Gly Pro 

340 345 350 

Thr Glu He Leu Gly Leu Thr Ser Asn Trp Lys Leu Phe Asp Ala Arg 

355 3'60 365 

Gly Gin Val Pro Gly Asp He Cys He Gin Val Gly Gly Arg Tyr Leu 

370 375 380 

Gin Thr Ala Phe Ala Ala Thr Arg Tyr Ala Gly Thr Glu Ala Val Leu 
385 390 395 400 

Cys Val Phe Asn Asp He Thr Val His Cys Glu Ala Glu Thr Ala Leu 

405 410 415 

Ser Asn Ala Lys Arg Ala Ala Asp Ala Ala Ser Gin Ala Lys Thr Leu 

420 425 430 

Phe Leu Ala Arg Met Ser His Glu He Arg Thr Pro Leu Tyr Gly Val 
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Figure 5F Continued 



435 

Leu Gly Thr 
450 



Leu Glu 



440 

Leu Leu Asp 
455 



445 

Leu Thr Thr Leu Asn Glu Arg Gin 
460 



Arg Ala Tyr 
465 



Leu Arg 



Thr lie Gin 
470 



Ser Ser Ser Ala Thr Leu Met Gin 
475 4B0 



Leu He Ser Asp Val Leu Asp Val Ser Lys He Glu Ala Gly Gin Met 

485 490 495 

Ala Leu Thr Leu Ala Ala Phe Asn Pro Leu Asp Leu Val Arg Glu Val 

500 505 510 

Leu Gly Asn Phe Ala Ala Ser Ala Met Ala Lys Asp Leu Gin Val Asp 

515 520 . 525 

Pro Leu Asp Thr Leu Ala Leu Glu Ala Gin Val Ala His Gly Phe Glu 

5 30 535 540 

Glu Ser Val Leu Phe Glu Val Ala Gly Gly Ser Val Gly His Phe Glu 
545 550 555 560 

Glu Gly -Val Val Gly Val Val Glu Gin Arg Leu Gin Arg Leu Phe Gin 

565 570 575 

Leu Gin Arg Arg Leu Val Ala His Leu His Glu Asp Asp Arg Gin Ala 

580 585 590 

Pro Arg Ser Gly Val Arg Arg Arg Leu Gly Ser Asp Pro Gly Gin Val 

595 " 600 605 

His His He Gly He Val Leu His Arg Abp Ser Pro Ala Thr Leu Ala 

610 * 615 620 

Ala Ala His Gly Met Ala Lys He Gly His Arg Gly Ser He Gly Val 



625 

Val Arg Asn 



Val Asn 
645 



630 

Phe Gin Ala 



635 640 
Ser Lys Thr Ser He Tyr He His 
650 655 



Tyr Arg Asp 



Thr Phe 
660 



Lys Ser Arg 



SPA, 
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Figure 5Q 

SEQ ID NO: 7 

cttcctcacc gcactttcat tcaccgcacc gttatcggcg tcatggacaa tgccctgcag 60 
tcctgagacc tcattaggaa tcatctgata gagggaattt ttccatgccg tcagtctttc 120 
ggcafSaagt tgaagaattt cttacagcct tttgatagcg gtttctccac tccgagtgct 180 
gcgctcaagc tgctccgcgfc g ctcggtggc gccttgaggt tgtgcgtgct atgcagcctg 240 
atattcagtg tgagc£fe§gt tttaaaccat caggtgtccc tcagtcggca agctafcgaat 300 
gtggctlf^t acgaagcgca gctttatttc gagcagcgcg aggcgttgct caatcacttg 360 
agcggcaatg tcgtgccctt ggccgcgggt agagcgctcg tcaacgaagc gccgaacaat 420 
gtgagcatcc tgccgttgag tgacggaggg cgaggtctgc tattgaccgc tcgcacgctc 480 
ggtgatctcc gggaaaagcg gctggcactg f|gtatctgg tcgataccga caaaggccct 540 
ctggtttacc ggcttaccgc cgatggtagg ccctcggcag cgatatccag cacgataacc 600 
aaagaggtgt accgagcctt gctggcgact ccgtcggcgc ctgttcactg ggtgactgac 660 
ggtggtaccc ctcaacggct gtaccttttt gaatccttag gcgatgagcc gggcgagggg 720 
tggctaggcc tggagattct cggcgaagac ctcgattcga fcjgjttgcgccg gaatgatgcc 780 
ggaaactacg fejctgctgga tcagcatggg caggtcgtac tcgctacgga cgcagaggcg 84 0 
ctggggagcg gtgcgtcgcg_ gacgcttttg cgtggagacg gcttcggttt catcggtgct 900 
ggcccactgc cgcagcatj&E ggtgcttttc cagcacgtgg ggtcttcgag ctgggatctg 960 
atctatcaca tcggtatcgg tcgcctgttg ctggctctgt ggctccctct gttacttgcc 1020 
tctgcgttgg cactcgcagt cggcatccta ctgcattggc tggtgcggag catcgagcga 1080 
cgcttgatag agcccgcaaa gcgacgcctt gaagcattga aggagagcga agccttttcc 1140 
cgtgcagtta tccaggccgc gcccgtcgcg ctgtgcgtgc tgcgtcgtgc cgacgccgca 1200 
gtggtcctgg aaaatcccca ggcgcgccaa tggctgggtg atagcgaggc gattgcccac 1260 
gacgcgccga gatggatttc ccaggcgttc gcaggaggtg tgaagtgttc tggagaagaa 1320 
ctggaaaccg aggcagggct acatcttcat ctcaattaca cgcccacccg ctataacggt 13 80 
gaagacgtat tgttctgcgc cttcagtgaa atcagtgcac gcaagcggati gjgaggcggaa 1440 
ctggctcgcg caaaatccct ggcggatgct gccaatgaag ccaagacgct gtttctcgcc 1500 
accgSgagcc atgaaatccg cacacctctg tacggcatgc ttggcacgct tgagctgctt 1560 
gggcgtaccg agctgagtcg gcagcaggcc ggttacctaa aggcaatcca gcattcctcg 1620 
tcgaccctgc tgcaactgat cagcgatgtg cttgacgtat ccaagataga ggccggccaa 1680 
ctggacctag agtgcgtgga attctccccg ctggaattga ccgaagaggt cgtgcagtcg 1740 
ttcaccggtg ccgcgcaggc caaggggctg cagttgtata cctgcctctc tgcggagctg 1800 
ccgctgcgca tgcggggggc cgcggcgtcg atccggcaga ttctcaacaa cctgctgagc 1860 
aacgcggtga agttcaccga caatggctat gtcaacgtcc acctgaaggc cagcgtggtc 1920 
gatgccgaat gtgtgatgct gacctggcag gtcaacgata ccggcatggg gatcaacgtc 1980 
gaggatcagc cgcgtctgtt cgaaccgttc taccagatac gccgctccga gcatccggtc 2040 
gcaggcacgg gcctcggctt gtcgatcagc cagcgcctgg cgcagctaat gaatggcagt 2100 
ctgaaactgg tcagtgagct ggggttgggc agcagcttta gcctcaggct tccgcttgag 2160 
cggatcgcga tgcaggctga gccgcaggac ctagccgggt gcgccgtcca agtgctggcg 2220 
cctgtccgcg acctaacgga atgcctgtgt ggctggatct cccgctgggg tggaagggcc 2280 
atggtcgcga cgccgaggtc gctggacgag gcggacgcga cctcgctgct ggtcgaagtg 2340 
ttactgctgg agggggcgcc gatgttcgaa gcatggccag -gatgccgggt ggagctttcc 2400 
cctcagggtg atatggagcc gcaggcacag ggccgcgact ggctgctcgg gctcaacaac 2460 
ctggacggcc tgcatcgtgc tctgggcctg gcccatgggc gtctcgctga tccttcgacg 2520 
ccgccgatac ggctggctcc gttgcgcaat ctaggtctcc gcgtcctagt ggtggaggat 2580 
aacgcgatca a.ccagttgat cttgagggac cagatggaag cgctgggctg cagcgtggag 2640 
ctgctcttcg atggtcgcga ggcgttgctg cactgccaga cggcctgctt cgacgtggtg 2700 
ctcaccgata tcaacatgcc gaacatgaac ggatacgagc taaccgcgga gc'tacggcgc 2760 
caagggttcc ggcagccgat catcggcgcg acggcgaacg ccatgcgtga ggagcgcgag 2 820 
cgctgcatgt ccgccgggat gaacgattgc ctggtcaaac cggtggatct gaatgccctt 2880 
cagaactgct tgattaatat tctcaaggtg gatcg^a«g- ctggaaatcc tatcgggtgc 2 94 0 
tggtggtcga agatcagccg tttcagcgcg aatacctgct caacctgttt cgcgagcgcg 3000 
gcgtgcagta cctggtaggt gccggcgacg gcgcggaggc gttgcgctgc ctgaagcagg 3 060 
acaggttcga cctgatcctc agcgatctga tgatgccggg catggatggt atccaaatga 3120 
tcctgcaact gccgtatctc aagcatcgtc cgaagctggc gctgatgagc tcctcgtcgc 3180 
agcggatgat gctcagtgcc agccgggtcg cccagagtct cggcttgtcg gtaatcgacc 3240 
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tqttgcccaa gccgactctg cccaaggcca tcggccaact tctggaacac ctggaaagat 3300 
gcctcaggca gaagctggag ccggaaaccg acgagactcc gcatgggcgc acggcgttgc 3360 
tggatgccct gcataacgag caactggtga cctggttcca ggctaagaaa tccctccaca 3420 
ccgggcgcat agtcggcgcc gaggcgttga tacgctggag ccacccgcag catggcctgt 3480 
tgctgcccag ctgtttcatg agtgatgtcg acgctaccgg tctgcacgag gcgttgctct 3540 
qgcgcgtgct cgaacagacc ctgaacgccc aggaatcgtg gcgcagggcg ggttacgaga 3600 
ttccggtttc ggtgaatctg ccgccgcacc tgctcgataa ccaggaactt ccggatcgac 3660 
tctatgagta cgtcggcgct cgcggggctt gtaccagctc actatgtttc gagttgaccg 3720 
agagcagtgt cacaactctg tcaagtaact actatgcagg tgcctgtcgc ttgcgcatga 3780 
aagggttcgg attggcccag gacgactttg gccagggtta cagctcgttc tataacctgg 3840 
tcacgacgcc tttcacggag ctgaagatcg accgctccct agtccaggga tgcgtagagg 3900 
ataacggcct caatgcagct gtcatcagtt gtattgagtt gggtcaccgc ctgaatctcg 3960 
acgtggtggc cgaaggcgtg gagacctgcg aggaactgaa tcttcttcgt cgtcttggct 4020 
gcgaccgggc gcagggtttc ctgatttcta aggcagtgtc tgctcgtgag ttcgagcggc 4080 
agttaaggga ggacggcccc agcctccttg ttfiUcgcag tatccccatt atcgcggagt 4140 
cgatcgcagc caaccaccgt cagcgcaaca gtgtcgctga aggaggccgc atcccgtgaa 4200 
gtctgctagt gccttggagc acgacaacaa acttttgctc aaatggacaa ccctctcgca 4260 
gagcctgagc a'tcggcttga tctgtgtggt ggtgctgacc gtattgctgt tcagcatctg 4320 
ttactggtcg ctggggagat tgtttcagga ggaggaggac aaagtctcct tccacttcac 4380 
ccgtEHB gatgttatac gggagcatga ggtatttctt gggcgcatcg ctcgaaaaag 4440 
cgacaagacc acccagaagt acgactatga cgtggtgcct ttgcagcggc acttgttggc . 4500. 
aaaggaaaac ggattagcgg tctatgaggg acgggagttt tcctttgctg Hccatttct 4560 
actggctacc aagcacgcgt tgagcgccga ttcctcggga gatccgtttt cgctcggtgt 4620 
attgctcgcc aatttctacg gaagcttctg gagtgtttcc gcctatcccg cgccacagtt 4680 
actgatcttt gatctttccg gcagcacccg cctggcagtg ccgtcgattc cctccacagc 474 0 
qcagcgtgac aggttgagcg gaagctatcc gggatagtc gagcgcattc tggcgcgctt 4800 
qcgcacccgg ccggtggggg aggacgctca gcgtgtccat tggatacgcg ctgatcgcta 4860 
tcgcgactcg gcgctggagf ^ttgggagt cgcccgggtt gatctgccgg aaacactctg 492.0 
gtggcacgac gagccgaacc atctgatcat cgctgcgagc ctgcttgatc tcaggcgaat 4980 
caatgacttc gaacagttgg ttgagcgccc ggcattcgat tcgtacagcc tggtatcgcc 504 0 
ggatggcgag gtattgctcg gcgcggcccc tgcgaccggc ctgagggatg gcctgaacct 5100 
cacccgacag ggggtcgccg ttcaactgcg cagccagcct gagaacggct ggctcgcggt 5160 
ctaccgaacc gactacggca atttctttcg ccactcccgg tggctggtgg caggtctgct 5220 
gctgaccccg gcgctgctcc tggccggttg gctcggg|H cgttggtaca ccagcagcgt 5280 
cgtcaacccg gtgcatcggg cgcaccggca actggtggag agcgacacct tcagccggac 534 0 
gctgatacag accgcgccgg tggctctggt ggtgctgacc caggatgacc agcaactggt 5400 
gacctgcaac cacttggccg cccagtggct gggcgggccc acggagatcc ttgggctgac 5460 
ttccaactgg aagcttttcg atgcgcgtgg gcaggtacca ggagaca.tct gtatccaggt 5520 
cggtgggcgc tatttgcaga ccgccttcgc ggcgacccgc tatgccggca ccgaggcggt 5580 
actgtgcgta ttcaacgaca tcacggtcca ctgcgaggcg gagaccgcgc tgtccaatgc 5640 
gaagcgagca gcggatgccg ccagccaggc caagaccctg ttcctggccc gcatgagcca 5700 
tgaaatccgt actcccctgt acggtgtcct tggcaccctg gagttgctcg acctgaccac 5760 
cctgaacgag cggcaacgcg cctacctacg caccatccag agttcgtctg cgacgctcat 5820 
gcaactgatt agcgatgtgc tggatgtctc gaagatcgaa gcggggcaga tggctctgac 5880 
cctggccgcc ttcaatccgc tggacctagt gcgggaagtg cttggcaact ttgccgccag 5940 
cgccatggcc aaggacctgc aggtagaccc gctcgatact cttgcgcttg aggcgcaggt 6000 
cgcgcatggc ttcgaagaaa gcgttctgtt cgaggttgct ggtggctcgg tcggccattt 6060 
cgaagagggt gtcgtcggcg ttgtcgaaca acgcctgcaa cgcctgtttc agctgcagcg 6120 
ccgccttgtc gcgcacctgc acgaggatga ccggcaggcg ccccgctccg gcgttcggcg 6180 
acggctcgga agcgaccctg gtcaggtgca ccacattggc atcgttctgc atcgggactc 6240 
tcctgccacc ctcgcggccg cgcatggaat ggcaaaaatc gggcacagag gatcgattgg 63 00 
cgtcgtccgt aacgtcaatt tccaggcgtc aaaaacaagt atctacattc attatagaga 6360 
tactttcaaa tctagafgla gggttttcct atagacatga ctctcttcac 6410 
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Figure 6A 

ORF1-1 

SEQ ID NO: 8 * 

atgaagttga agaatttctt acagcctttt gatagcggtt tctccactcc gagtgctgcg 60 

ctcaagctgc tccgcatgct cggtggcgcc ttgatgttgt gcgtgctatg cagcctgata 120 

ttcagtgtga gcatggtttt aaaccatcag gtgtccctca gtcggcaagc tatgaatgtg 180 

gctatgtacg aagcgcagct ttatttcgag cagcgcgagg cgttgctcaa tcacttgagc 240 

ggcaatgtcg tgcccttggc cgcgggtaga gcgctcgtca acgaagcgcc gaacaatgtg 300 

agcatcctgc cgttgagtga cggagggcga ggtctgctat tgaccgctcg cacgctcggt 360 

gatctccggg aaaagcggct ggcactgatg tatctggtcg ataccgacaa aggccctctg 420 

gtttaccggc ttaccgccga tggtaggccc tcggcagcga tatccagcac gataaccaaa 480 

gaggtgtacc gagccttgct ggcgactccg tcggcgcctg ttcactgggt gactgacggt 540 

ggtacccctc aacggctgta cctttttgaa tccttaggcg atgagccggg cgaggggtgg 600 

ctaggcctgg agattctcgg cgaagacctc gattcgatgt tgcgccggaa tgatgccgga 660 

aactacatgc tgctggatca gcatgggcag gtcgtactcg ctacggacgc agaggcgctg 720 

gggagcggtg cgtcgcggac gcttttgcgt ggagacggct tcggtttcat cggtgctggc 780 

ccactgccgc agcatatggt gcttttccag cacgtggggt cttcgagctg ggatctgatc 840 

tatcacatcg gtatcggtcg cctgttgctg gctctgtggc tccctctgtt acttgcctct 900 

gcgttggcac tcgcagtcgg catcctactg cattggctgg tgcggagcat cgagcgacgc 960 

ttgatagagc ccgcaaagcg acgccttgaa gcattgaagg agagcgaagc cttttcccgt 1020 

gcagttatcc aggccgcgcc cgtcgcgctg tgcgtgctgc gtcgtgccga cgccgcagtg 1080 

gtcctggaaa atccccaggc gcgccaatgg ctgggtgata gcgaggcgat tgcccacgac 1140 

gcgccgagat ggatttccca ggcgttcgca ggaggtgtga agtgttctgg agaagaactg 1200 

gaaaccgagg cagggctaca tcttcatctc aattacacgc ccacccgcta taacggtgaa 1260 

gacgtattgt tctgcgcctt cagtgaaatc agtgcacgca agcggatgga ggcggaactg 1320 

gctcgcgcaa aatccctggc ggatgctgcc aatgaagcca agacgctgtt tctcgccacc .1380 

atgagccatg aaatccgcac acctctgtac ggcatgcttg gcacgcttga gctgcttggg 1440 

cgtaccgagc tgagtcggca gcaggccggt tacctaaagg caatccagca ttcctcgtcg 1500 

accctgctgc aactgatcag cgatgtgctt gacgtatcca agatagaggc cggccaactg 1560 

gacctagagt gcgtggaatt ctccccgctg gaattgaccg aagaggtcgt gcagtcgttc 1620 

accggtgccg cgcaggccaa ggggctgcag ttgtatacct gcctctctgc ggagctgccg 1680 

ctgcgcatgc ggggggccgc ggcgtcgatc cggcagattc tcaacaacct gctgagcaac 1740 

gcggtgaagt tcaccgacaa tggctatgtc aacgtccacc tgaaggccag cgtggtcgat 1800 

gccgaatgtg tgatgctgac ctggcaggtc aacgataccg gcatcjgggat caacgtcgag 1860 

gatcagccgc gtctgttcga accgttctac cagatacgcc gctccgagca tccggtcgca 1920 

ggcacgggcc tcggcttgtc gatcagccag cgcctggcgc agctaatgaa' tggcagtctg 1980 

aaactggtca gtgagctggg gttgggcagc agctttagcc tcagg'cttcc gcttgagcgg 2040 

atcgcgatgc aggctgagcc gcaggaccta gccgggtgcg ccgtccaagt gctggcgccf 2100 

gtccgcgacc taacggaatg cctgtgtggc tggatctccc gctggggtgg aa'gggccatg 2160 

gtcgcgacgc cgaggtcgct ggacgaggcg gacgcgacct cgctgctggt cgaagtgtta 2220 

ctgctggagg gggcgccgat gttcgaagca tggccaggat gccgggtgga gctttcccct 2280 

cagggtgata tggagccgca ggcacagggc cgcgactggc tgctcgggct caacaacctg 2340 

gacggcctgc atcgtgctct gggcctggcc catgggcgtc tcgctgatcc ttcgacgccg 24 00. 

ccgatacggc tggctccgtt gcgcaatcta ggtctccgcg tcctagtggt ggaggataac 2460 

gcgatcaacc agttgatctt gagggaccag atggaagcgc tgggctgcag cgtggagctg 2520 

ctcttcgatg gtcgcgaggc gttgctgcac tgccagacgg cctgcttcga cgtggtgctc 2580 

accgatatca acatgccgaa catgaacgga tacgagctaa ccgcggagct acggcgccaa 2 64 0 

gggttccggc agccgatcat cggcgcgacg gcgaacgcca tgcgtgagga gcgcgagcgc 2700 

tgcatgtccg ccgggatgaa cgattgcctg gtcaaaccgg tggatctgaa tgcccttcag 2760 

aactgcttga ttaatattct caaggtggat cgatga 2796 
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Figure 6B 

ORF1-2 

SEQ ID No: 9 



st-actcqqtq gcgccttgat gttgtgcgtg ctatgcagcc tgatattcag tgtgagcatg 60 
StttaHc? SSaggtgtc cctcagtcgg caagctatga atgtggctat gtacgaagcg 120 
SaSSItt tccaqcaLg cgagglgttg ctcaatcact tgagcggcaa tgtcgtgccc 180 
ttqqccgcgg gSgagcgc? cgtcLlgal gcgccgaaca atgtgagcat octgccgttg 240 
aaSaSSl ggciaggtct gctattgacc gctcgcacgc tcggtgatct ccgggaaaag 300 
c^qctgglac t|Sg?Jtct Sgtcgatacc gacaaaggcc ctctggttta ccggcttacc 360 
olcSggta ggccltcggc agcgatatcc agcacgataa ccaaagaggt gtaccgagcc 420 
ttqrtgglga clccgtcggc gcctgttcac tgggtgactg acggtggtac ccctcaacgg 480 
ctSccttt ttgaatccEt aggcgatgag ccgggcgagg ggtggctagg cctggagatt 540 
ctcggcgaag acctcgattc gatgttgcgc cggaatgatg ccggaaacta catgctgctg 600 
qatcagcatg ggcaggtcgt actcgctacg gacgcagagg cgctggggag cggtgcgtcg 660 
S£2cttt tgcgtggaga cggcttcggt ttcatcggtg ctggcccact gccgcagcat 720 
aSStgSt SSgSegt gSgtetSg agctgggatc tgatctatca catcggtatc 780 
wtcqSctgt tgctggctct gtggctccct ctgttacttg cctctgcgtt ggcactcgca 840 
Sweatee tlctpattg gctggtgcgg agcatcgagc gacgcttgat agagcccgca 900 
aaqcgacgcc ttgalgcatt gaaggagagc gaagcctttt cccgtgcagt tatccaggcc 960 
gcgcccglcg cgctglgcgt gctgcgtcgt gccgacgccg cagtggtcct g9""tccc 1020 
caggcgcgcc aatggctggg tgatagcgag gcgattgccc acgacgcgcc gagatggatt 1080 
tcccaggcgt tcgcaggagg tgtgaagtgt tctggagaag aactggaaac cgaggcaggg 1140 
SS atctclatll cacgcccacc cgctataacg gtgaagacgt attgttctgc 1200 
gccttcagtg aaatcagtgc acgcaagcgg atggaggcgg aactggctcg cgcaaaatcc 1260 
ctggcggatl ctgccaatga agccaagacg ctgtttctcg ccaccatgag ccatgaaatc 1320 
cgcacacctc tglacggcat gcttggcacg cttgagctgc ttgggcgtac cgagctgagt 1380 
cqqcagcagg ccggtiacct aaaggcaatc cagcattcct cgtcgaccct gctgcaactg 1440 
atcagcgatg tgcStgacgt atccaagata gaggccggcc aactggacct agagtgcgtg 1500 
Saattctcc? cgctggaatt gaccgaagag gtcgtgcagt cgttcaccgg tgccgcgcag 1560 
qccaaggggc tlcagttgta tacctgcctc tctgcggagc tgccgctgcg catgcggggg 1620 
qccgcS cgatccggca gattctcaac aacctgctga gcaacgcggt gaagttcacc 1680 
gacaatggft atgtcallgt ccacctgaag gccagcgtgg tcgatgccga atgtgtgatg 1740 
ctgacctggc aggtcaacga taccggcatg gggatcaacg tcgaggatca gccgcgtctg 1800 
ttcgaaccgt tc?accagat acgccgctcc gagcatccgg tcgcaggcac 3 9 gcctcggc 1860 
^ateaatca qccagcgcct ggcgcagcta atgaatggca gtctgaaact ggtcagtgag 1920 
cSgggt^g Kagcagctt ?agcctcagg cttccgcttg agcggatcgc gatgcaggct 1980 
gagccgcagg acctagccgg gtgcgccgtc caagtgctgg cgcctgtccg cgacctaacg 2040 
qaatqcctgt gtggctggat ctcccgctgg ggtggaaggg ccatggtcgc gacgccgagg 2100 
?cgctggacg aggcggacgc gacctlgctg ctggtcgaag tgttactgct ggagggggcg 2160 
ccSgttcg aagcaEggcc aggatgccgg gtggagcttt cccctcaggg tgatatggag 2220 
ccgcaggcac agggccgcga ctggctgctc gggctcaaca acctggacgg cctgcatcgt 2280 
gc?ctgggcc tggcccatgg gcgtctcgct gatccttcga cgccgccgat acggctggct 2340 
ccgttgcgca atctaggtct ccgcgtccta gtggtggagg ataacgcgat caaccagttg 2400 
' atcttgaigg accagatgga agcgctgggc tgcagcgtgg agctgctctt cgatggtcgc 2460 
gaggcgttic tgcactgcca gacggcctgc ttcgacgtgg tgctcaccga tatcaacatg 2520 
IcgLIatga acggatacga gctaaccgcg gagctacggc gccaagggtt ccggcagccg 2580 
atcatcggcg cgacggcgaa cgccatgcgt gaggagcgcg agcgctgcat gtccgccggg 2640 
atgaacgat? glctgltcaa accggtggat ctgaatgccc ttcagaactg ettg.tt.at 2700 
attctcaagg tggatcgatg a 
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Figure 6C 

ORF1-3 

SEQ ID NO: 10 ' 
atgttgtgcg tgctatgcag cctgatattc agtgtgagca tggttttaaa ccatcaggtg 60 
tccctcagtc ggcaagctat gaatgtggct atgtacgaag cgcagcttta tttcgagcag 120 
cqcgaggcgt -tgctcaatca cttgagcggc aatgtcgtgc ccttggccgc gggtagagcg 180 
ctcgtcaacg aagcgccgaa caatgtgagc atcctgccgt tgagtgacgg agggcgaggt 240 
ctgctattga ccgctcgcac gctcggtgat ctccgggaaa agcggctggc actgatgtat 300 
ctggtcgata ccgacaaagg ccctctggtt taccggctta ccgccgatgg taggccctcg 360 
gcagcgatat ccagcacgat aaccaaagag gtgtaccgag ccttgctggc gactccgtcg 420 
gcgcctgttc actgggtgac tgacggtggt acccctcaac ggctgtacct ttttgaatcc 4 80 
ttaggcgatg agccgggcga ggggtggcta ggcctggaga ttctcggcga agacctcgat 540 
tcgatgttgc gccggaatga tgccggaaac tacatgctgc tggatcagca tgggcaggtc 600 
gtactcgcta cggacgcaga ggcgctgggg agcggtgcgt cgcggacgct tttgcgtgga 660 
gacggcttcg gtttcatcgg tgctggccca ctgccgcagc atatggtgct tttccagcac 720 
gtggggtctt cgagctggga tctgatctat cacatcggta tcggtcgcct gttgctggct 780 
ctgtggctcc ctctgttact tgcctctgcg ttggcactcg cagtcggcat cctactgcat 840 
tggctggtgc ggagcatcga gcgacgcttg atagagcccg caaagcgacg ccttgaagca 900 
ttgaaggaga gcgaagcctt ttcccgtgca gttatccagg ccgcgcccgt cgcgctgtgc 960 
gtgctgcgtc gtgccgacgc cgcagtggtc ctggaaaatc cccaggcgcg ccaatggctg 1020 
ggtgatagcg aggcgattgc ccacgacgcg ccgagatgga tttcccaggc gttcgcagga 1080 
ggtgtgaagt gttctggaga agaactggaa accgaggcag ggctacatct . tcatctcaat 1140 
tacacgccca cccgctataa cggtgaagac gtattgttct gcgccttcag tgaaatcagt 1200 
gcacgcaagc ggatggaggc ggaactggct cgcgcaaaat ccctggcgga tgctgccaat 1260 
gaagccaaga cgctgtttct cgccaccatg agccatgaaa tccgcacacc tctgtacggc 1320 
atgcttggca cgcttgagct gcttgggcgt accgagctga gtcggcagca ggccggttac 1380 
ctaaaggcaa tccagcattc ctcgtcgacc ctgctgcaac tgatcagcga tgtgcttgac 1440 
gtatccaaga tagaggccgg ccaactggac ctagagtgcg tggaattctc cccgctggaa 1500 
ttgaccgaag aggtcgtgca gtcgttca-cc ggtgccgcgc aggccaaggg gctgcagttg 1560 
tatacctgcc tctctgcgga gctgccgctg cgcatgcggg gggccgcggc gtcgatccgg 1620 
cagattctca acaacctgct gagcaacgcg gtgaagttca ccgacaatgg ctatgtcaac 1680 
gtccacctga aggccagcgt ggtcgatgcc gaatgtgtga tgctgacctg gcaggtcaac 1740 
gataccggca tggggatcaa cgtcgaggat cagccgcgtc tgttcgaacc gttctaccag 1800 
atacgccgct ccgagcatcc ggtcgcaggc acgggcctcg gcttgtcgat cagccagcgc 1860 
ctggcgcagc taatgaatgg cagtctgaaa ctggtcagtg agctggggtt gggcagcagc 1920 
tttagcctca ggcttccgct tgagcggatc gcgatgcagg ctgagccgca ggacctagcc 1980 
gggtgcgccg tccaagtgct ggcgcctgtc cgcgacctaa cggaatgcct gtgtggctgg 2040 
atctcccgct ggggtggaag ggccatggtc gcgacgccga ggtcgctgga cgaggcggac 2100 
gcgacctcgc tgctggtcga agtgttactg ctggaggggg cgccgatgtt cgaagcatgg 2160' 
ccaggatgcc gggtggagct ttcccctcag ggtgatatgg agccgcaggc acagggccgc 2220 
gactggctgc tcgggctcaa caacctggac ggcctgcatc gtgctctggg cctggcccat 2280 
gggcgtctcg ctgatccttc gacgccgccg atacggctgg ctccgttgcg caatctaggt- 2340 
ctccgcgtcc tagtggtgga ggataacgcg atcaaccagt tgatcttgag ggaccagatg 2400 
gaagcgctgg gctgcagcgt ggagctgctc ttcgatggtc gcgaggcgtt gctgcactgc 2460 
cagacggcct gcttcgacgt ggtgctcacc gatatcaaca . tgccgaacat gaacggatac 2520 
gagctaaccg cggagctacg gcgccaaggg ttccggcagc cgatcatcgg cgcgacggcg 2580 
aacgccatgc gtgaggagcg cgagcgctgc atgtccgccg ggatgaacga ttgcctggtc 2640 
aaaccggtgg atctgaatgc ccttcagaac tgcttgatta atattctcaa ggtggatcga 2700 
tga 2703 
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Figure 6D 

ORF1-4 • 

atqgttttaa accatcaggt gtccctcagt cggcaagcta tgaatgtggc tatgtacgaa 60 
qcgcagcttt atttcgagca gcgcgaggcg ttgctcaatc acttgagcgg caatgtcgtg 120 
cccttggccg cgggtagagc gctcgtcaac gaagcgccga acaatgtgag catcctgccg 180 
ttgagtgacg gagggcgagg tctgctattg accgctcgca cgctcggtga tctccgggaa 240 
SSSgS cacliatgta tctggtcgat accgacaaag gccctctggt ttaccggctt 300 
accqccgatg gtaggccctc ggcagcgata tccagcacga taaccaaaga ggtgtaccga 360 
qccttgctgg cgactccgtc ggcgcctgtt cactgggtga ctgacggtgg tacccctcaa 420 
cqqctgtacc tttttgaatc cttaggcgat gagccgggcg aggggtggct aggcctggag 4 80 
attctcggcg aagacctcga ttcgatgttg cgccggaatg atgccggaaa ctacatgctg 54 0 
ctqgatcagc atgggcaggt cgtactcgct acggacgcag aggcgctggg gagcggtgcg 600 
tcqcggacgc ttttgcgtgg agacggcttc ggtttcatcg gtgctggccc actgccgcag 660 
catatggtgc ttttccagca cgtggggtct tcgagctggg atctgatcta tcacatcggt 720 
atcggtcgcc tgttgctggc tctgtggctc- cctctgttac ttgcctctgc gttggcactc 780 
qcagtcggca tcctactgca ttggctggtg cggagcatcg agcgacgctt gatagagccc 840 
qcaaagcgac gccttgaagc attgaaggag agcgaagcct tttcccgtgc agttatccag -900 
gccgcgcccg tcgcgctgtg cgtgctgcgt cgtgccgacg ccgcagtggt cctggaaaat 960 . 
ccccaggcgc gccaatggct gggtgatagc gaggcgattg cccacgacgc gccgagatgg 1020 
atttcccagg cgttcgcagg aggtgtgaag tgttctggag aagaactgga aaccgaggca 1080 
gggctacatc ttcatctcaa ttacacgccc acccgctata acggtgaaga cgtattgttc 1140 
tgcgccttca gtgaaatcag tgcacgcaag cggatggagg cggaactggc tcgcgcaaaa 1200 
tccctggcgg atgctgccaa tgaagccaag acgctgtttc tcgccaccat gagccatgaa 1260 
atccgcacac ctctgtacgg catgcttggc acgcttgagc tgcttgggcg taccgagctg 1320 
agtcggcagc aggccggtta cctaaaggca atccagcatt cctcgtcgac cctgctgcaa 13 80 
ctgatcagcg atgtgcttga cgtatccaag atagaggccg gccaactgga cctagagtgc 1440 
gtggaattct ccccgctgga attgaccgaa gaggtcgtgc agtcgttcac cggtgccgcg 1500 
caggccaagg ggctgcagtt gtatacctgc ctctctgcgg agctgccgct gcgcatgcgg 1560 
ggggccgcgg cgtcgatccg gcagattctc aacaacctgc tgagcaacgc ggtgaagttc 1620 
accgacaatg gctatgtcaa cgtccacctg aaggccagcg tggtcgatgc cgaatgtgtg 1680 
atgctgacct ggcaggtcaa cgataccggc atggggatca acgtcgagga- tcagccgcgt 1740 
ctgttcgaac cgttctacca gatacgccgc tccgagcatc cggtcgcagg cacgggcctc 1800 
ggcttgtcga tcagccagcg cctggcgcag ctaatgaatg gcagtctgaa actggtcagt 1860 
gagctggggt tgggcagcag ctttagcctc aggcttccgc ttgagcggat cgcgatgcag 1920 
gctgagccgc aggacctagc cgggtgcgcc gtccaagtgc tggcgcctgt ccgcgaccta 1980 
acggaatgcc tgtgtggctg gatctcccgc tggggtggaa gggccatggt cgcgacgccg 2040 
aggtcgctgg acgaggcgga cgcgacctcg ctgctggtcg aagtgttact gctggagggg 2100 
gcgccgatgt tcgaagcatg gccaggatgc cgggtggagc tttcccctca gggtgatatg 2160 
gagccgcagg cacagggccg cgactggctg ctcgggctca acaacctgga cggcctgcat 2220 
cgtgctctgg gcctggccca tgggcgtctc gctgatcctt cgacgccgcc gatacggctg 2280 
gctccgttgc gcaatctagg tctccgcgtc ctagtggtgg aggataacgc gatcaaccag 2340 
ttgatcttga gggaccagat ggaagcgctg ggctgcagcg tggagctgct cttcgatggt 2400 
cgcgaggcgt tgctgcactg ccagacggcc tgcttcgacg tggtgctcac cgatatcaac 2460 
atgccgaaca tgaacggata cgagctaacc gcggagctac ggcgccaagg gttccggcag 2520 
ccgatcatcg gcgcgacggc gaacgccatg cgtgaggagc gcgagcgctg catgtccgcc 2580 
gggatgaacg attgcctggt caaaccggtg gatctgaatg cccttcagaa ctgcttgatt 2640 
aatattctca aggtggatcg atga 2664 
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Figure 6E 

ORF1-5 

SEQ ID NO: 12 

atgaatgtgg ctatgtacga agcgcagctt tatttcgagc agcgcgaggc gttgctcaat 60 
cacttgagcg gcaatgtcgt gcccttggcc gcgggtagag cgctcgtcaa cgaagcgccg 120 
aacaatgtga gcatcctgcc gttgagtgac ggagggcgag gtctgctatt gaccgctcgc 180 
acgctcggtg atctccggga aaagcggctg gcactgatgt atctggtcga taccgacaaa 240 
ggccctctgg tttaccggct taccgccgat ggtaggccct cggcagcgat atccagcacg 300 
ataaccaaag aggtgtaccg agccttgctg gcgactccgt cggcgcctgt tcactgggtg 360 
actgacggtg gtacccctca acggctgtac ctttttgaat ccttaggcga tgagccgggc 420 
gaggggtggc taggcctgga gattctcggc gaagacctcg attcgatgtt gcgccggaat 480 
gatgccggaa actacatgct gctggatcag catgggcagg tcgtactcgc tacggacgca 540 
gaggcgctgg ggagcggtgc gtcgcggacg cttttgcgtg gagacggctt cggtttcatc 600 
ggtgctggcc' cactgccgca gcatatggtg cttttccagc acgtggggtc ttcgagctgg 660 
gatctgatct atcacatcgg tatcggtcgc ctgttgctgg ctctgtggct ccctctgtta 720 
cttgcctctg cgttggcact cgcagtcggc atcctactgc attggctggt gcggagcatc 780 
gagcgacgct tgatagagcc cgcaaagcga cgccttgaag cattgaagga gagcgaagcc 840 
ttttcccgtg cagttatcca ggccgcgccc gtcgcgctgt gcgtgctgcg tcgtgccgac 900 
gccgcagtgg tcctggaaaa tccccaggcg cgccaatggc tgggtgatag cgaggcgatt 960 
gcccacgacg cgccgagatg gatttcccag gcgttcgcag gaggtgtgaa gtgttctgga 1020 
gaagaactgg aaaccgaggc agggctacat cttcatctca attacacgcc cacccgctat 1080 
aacggtgaag acgtattgtt ctgcgccttc agtgaaatca gtgcacgcaa gcggatggag 1140 
gcggaactgg ctcgcgcaaa atccctggcg gatgctgcca atgaagccaa gacgctgttt 1200 
ctcgccacca tgagccatga aatccgcaca cctctgtacg gcatgcttgg cacgcttgag 1260 
ctgcttgggc gtaccgagct gagtcggcag caggccggtt acctaaaggc aatccagcat 1320 
tcctcgtcga ccctgctgca actgatcagc gatgtgcttg acgtatccaa gatagaggcc 1380 
ggccaactgg acctagagtg cgtggaattc tccccgctgg aattgaccga agaggtcgtg 1440 
cagtcgttca ccggtgccgc gcaggccaag gggctgcagt tgtatacctg cctctctgcg 1500 
gagctgccgc tgcgcatgcg gggggccgcg gcgtcgatcc ggcagattct caacaacctg 1560 
ctgagcaacg cggtgaagtt caccgacaat ggctatgtca acgtccacct gaaggccagc 1620 
gtggtcgatg ccgaatgtgt gatgctgacc tggcaggtca acgataccgg catggggatc 1680 
aacgtcgagg. atcagccgcg tctgttcgaa ccgttctacc agatacgccg ctccgagcat 1740 
ccggtcgcag gcacgggcct cggcttgtcg atcagccagc gcctggcgca gctaatgaat 1800 
ggcagtctga aactggtcag tgagctgggg ttgggcagca gctttagcct caggcttccg 1860 
cttgagcgga tcgcgatgca ggctgagccg caggacctag ccgggtgcgc cgtccaagtg 1920 
ctggcgcctg tccgcgacct aacggaatgc ctgtgtggct ggatctcccg ctggggtgga 1980 
agggccatgg tcgcgacgcc gaggtcgctg gacgaggcgg acgcgacctc gctgctggtc 2040 
gaagtgttac tgctggaggg ggcgccgatg ttcgaagcat ggccaggatg ccgggtggag 2100 
ctttcccctc agggtgatat ggagccgcag gcacagggcc gcgactggct gctcgggctc 2160 
aacaacctgg acggcctgca tcgtgctctg ggcctggccc atgggcgtct cgctgatcct 2220 
tcgacgccgc cgatacggct ggctccgttg cgcaatctag gtctccgcgt cctagtggtg 2280 
gaggataacg cgatcaacca gttgatcttg agggaccaga tggaagcgct gggctgcagc 2340 
gtggagctgc tcttcgatgg tcgcgaggcg ttgctgcact gccagacggc ctgcttcgac 24 00 
gtggtgctca ccgatatcaa catgccgaac atgaacggat acgagctaac cgcggagcta 24 60 
cggcgccaag ggttccggca gccgatcatc ggcgcgacgg cgaacgccat gcgtgaggag 2520 
cgcgagcgct gcatgtccgc cgggatgaac gattgcctgg tcaaaccggt ggatctgaat 2580 
gcccttcaga actgcttgat taatattctc aaggtggatc gatga 2625 
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Figure 6F 

ORF1-6 

SEQ ID NO: 13 

atgtacgaag cgcagcttta tttcgagcag cgcgaggcgt tgctcaatca cttgagcggc 60 
aatgtcgtgc ccttggccgc gggtagagcg ctcgtcaacg aagcgfccgaa caatgtgagc 120 
atcctgccgt tgagtgacgg agggcgaggt ctgctattga ccgctcgcac gctcggtgat 180 
ctccgggaaa agcggctggc actgatgtat ctggtcgata ccgacaaagg ccctctggtt 240 
taccggctta ccgccgatgg taggccctcg gcagcgatat ccagcacgat aaccaaagag 300 
qtgtaccgag ccttgctggc gactccgtcg gcgcctgttc actgggtgac tgacggtggt 360 
acccctcaac ggctgtacct ttttgaatcc ttaggcgatg agccgggcga ggggtggcta 420 
ggcctggaga ttctcggcga agacctcgat tcgatgttgc gccggaatga tgccggaaac 480 
tacatgctgc tggatcagca tgggcaggtc gtactcgcta cggacgcaga ggcgctgggg 540 
agcggtgcgt cgcggacgct tttgcgtgga gacggcttcg gtttcatcgg tgctggccca 600 ■ 
ctgccgcagc atatggtgct tttccagcac gtggggtctt cgagctggga tctgatctat 660 
cacatcggta tcggtcgcct gttgctggct ctgtggctcc ctctgttact tgcctctgcg 720 
ttggcactcg cagtcggcat cctactgcat tggctggtgc ggagcatcga gcgacgcttg 780 
atagagcccg caaagcgacg ccttgaagca ttgaaggaga gcgaagcctt ttcccgtgca 840 
gttatccagg ccgcgcccgt cgcgctgtgc gtgctgcgtc gtgccgacgc cgcagtggtc 900 
ctggaaaatc cccaggcgcg ccaatggctg ggtgatagcg aggcgattgc ccacgacgcg 960 
ccgagatgga tttcccaggc gttcgcagga ggtgtgaagt gttctggaga agaactggaa 1020 
accgaggcag ggctacatct tcatctcaat tacacgccca cccgctataa cggtgaagac 1080 
gtattgttct gcgccttcag tgaaatcagt gcacgcaagc ggatggaggc ggaactggct 1140 
cgcgcaaaat ccctggcgga tgctgccaat gaagccaaga cgctgtttct cgccaccatg 1200 
agccatgaaa tccgcacacc tctgtacggc atgcttggca cgcttgagct gcttgggcgt 1260 
accgagctga gtcggcagca ggccggttac ctaaaggcaa tccagcattc ctcgtcgacc 1320 
ctgctgcaac tgatcagcga tgtgcttgac gtatccaaga tagaggccgg ccaactggac 13 80 
ctagagtgcg tggaattctc cccgctggaa ttgaccgaag aggtcgtgca gtcgttcacc 14*40 
ggtgccgcgc aggccaaggg gctgcagttg tatacctgcc tctctgcgga gctgcc$ctg 1500 
cgcatgcggg gggccgcggc gtcgatccgg cagattctca acaacctgct gagcaacgcg 1560 
gtgaagttca ccgacaatgg ctatgtcaac gtccacctga aggccagcgt ggtcgatgcc 1620 
gaatgtgtga tgctgacctg gcaggtcaac gataccggca tggggatcaa cgtcgaggat 1680 
cagccgcgtc tgttcgaacc gttctaccag atacgccgct ccgagcatcc ggtcgcaggc 1740 
acgggcctcg gcttgtcgat cagccagcgc ctggcgcagc taatgaatgg cagtctgaaa 1800 
ctggtcagtg agctggggtt gggcagcagc tttagcctca ggcttccgct tgagcggatc 1860 
gcgatgcagg ctgagccgca ggacctagcc gggtgcgccg tccaagtgct ggcgcctgtc 1920 
cgcgacctaa cggaatgcct gtgtggctgg atctcccgct ggggtggaag ggccatggtc 1980 
gcgacgccga ggtcgctgga cgaggcggac gcgacctcgc tgctggtcga agtgttactg 2040 
ctggaggggg cgccgatgtt cgaagcatgg ccaggatgcc gggtggagct ttcccctcag 2100 
ggtgatatgg agccgcaggc acagggccgc gactggctgc tcgggctcaa caacctggac 2160 
ggcctgcatc gtgctctggg cctggcccat gggcgtctcg ctgatccttc gacgccgccg 2220 
atacggctgg ctccgttgcg caatctaggt ctccgcgtcc tagtggtgga ggataacgcg 2280 
atcaaccagt tgatcttgag ggaccagatg gaagcgctgg gctgcagcgt ggagctgctc 2340 
ttcgatggtc gcgaggcgtt gctgcactgc cagacggcct gcttcgacgt ggtgctcacc 2400 
gatatcaaca tgccgaacat gaacggatac gagctaaccg cggagctacg gcgccaaggg 2460 
ttccggcagc cgatcatcgg cgcgacggcg aacgccatgc gtgaggagcg cgagcgctgc 2520 
atgtccgccg ggatgaacga ttgcctggtc aaaccggtgg atctgaatgc ccttcagaac 2580 
tgcttgatta atattctcaa ggtggatcga tga 2613 
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Figure 6G 



0RP1-7 

SEQ ID NO: 14 



atatatctgg tcgataccga caaaggccct ctggtttacc ggcttaccgc cgatggtagg 60 
ccctcggcag cgatatccag cacgataacc aaagaggtgt accgagcctt gctggcgact 120 
ccqtcggcgc ctgttcactg ggtgactgac ggtggtaccc ctcaacggct gtaccttttt 180 
gaatccttag gcgatgagcc gggcgagggg tggctaggcc tggagattct cggcgaagac 240 
ctcqattcga tgttgcgccg gaatgatgcc ggaaactaca tgctgctgga tcagcatggg 300 
caqqtcgtac tcgctacgga cgcagaggcg ctggggagcg gtgcgtcgcg gacgcttttg 360 
cqtqqagacg gcttcggttt catcggtgct ggcccactgc cgcagcatat ggtgcttttc 420 
cLcacgtgg ggtcttcgag ctgggatctg atctatcaca tcggtatcgg tcgcctgttg 480 
ctqqctctgt ggctccctct gttacttgcc tctgcgttgg cactcgcagt cggcatccta 540 
ctgcattggc tggtgcggag catcgagcga cgcttgatag agcccgcaaa gcgacgcctt 600 
qaagcattga aggagagcga agccttttcc cgtgcagtta tccaggccgc gcccgtcgcg 660 
ctqtgcgtgc tgcgtcgtgc cgacgccgca gtggtcctgg aaaatcccca ggcgcgccaa 720 
tggctgggtg atagcgaggc gattgcccac ■ gacgcgccga gatggatttc ccaggcgttc 780 
qcaqqaggtg tgaagtgttc tggagaagaa ctggaaaccg aggcagggct acatcttcat 840 
ctcaattaca cgcccacccg ctataacggt gaagacgtat tgttctgcgc cttcagtgaa 900 
atcagtgcac gcaagcggat ggaggcggaa ctggctcgcg caaaatccct ggcggatgct 960 
qccaatgaag ccaagacgct gtttctcgcc accatgagcc atgaaatccg cacacctctg 1020 
tacggcatgc ttggcacgct tgagctgctt gggcgtaccg agctgagtcg gcagcaggcc 1080 
qqttacctaa aggcaatcca gcattcctcg tcgaccctgc tgcaactgat cagcgatgtg 114 0 
cttgacgtat ccaagataga . ggccggccaa ctggacctag agtgcgtgga attctccccg 1200 
ctqgaattga ccgaagaggt . cgtgcagtcg ttcaccggtg ccgcgcaggc caaggggctg 1260 
cagttgtata cctgcctctc tgcggagctg ccgctgcgca tgcggggggc cgcggcgtcg 1320 
atccggcaga ttctcaacaa cctgctgagc aacgcggtga agttcaccga caatggctat 13 80 
gtcaacgtcc acctgaaggc cagcgtggtc gatgccgaat gtgtgatgct gacctggcag 1440 
gtcaacgata ccggcatggg gatcaacgtc gaggatcagc cgcgtctgtt. cgaaccgttc 1500 
taccagatac gccgctccga gcatccggtc gcaggcacgg gcctcggctt gtcgatcagc 1560 
cagcgcctgg cgcagctaat gaatggcagt ctgaaactgg • tcagtgagct ggggttgggc 1620 
agcagcttta gcctcaggct tccgcttgag cggatcgcga tgcaggctga gccgcaggac 1680 
ctagccgggt gcgccgtcca agtgctggcg cctgtccgcg acctaacgga atgcctgtgt 174 0 
ggctggatct cccgctgggg tggaagggcc atggtcgcga cgccgaggtc gctggacgag .1800 
gcggacgcga cctcgctgct ggtcgaagtg ttactgctgg .agggggcgcc gatgttcgaa 1860 
gcatggccag gatgccgggt ggagctttcc cctcagggtg atatggagcc gcaggcacag 1920 
ggccgcgact ggctgctcgg gctcaacaac ctggacggcc tgcatcgtgc tctgggcctg 1980 
gcccatgggc gtctcgctga tccttcgacg ccgccgatac ggctggctcc gttgcgcaat 2040 
ctaggtctcc gcgtcctagt ggtggaggat aacgcgatca accagttgat cttgagggac 2100 
cagatggaag cgctgggctg cagcgtggag ctgctcttcg atggtcgcga ggcgttgctg 2160 
cactgccaga cggcctgctt cgacgtggtg ctcaccgata tcaacatgcc gaacatgaac 2220 
ggatacgagc taaccgcgga gctacggcgc caagggttcc ggcagccgat catcggcgcg 2280 
acggcgaacg ccatgcgtga ggagcgcgag cgctgcatgt ccgccgggat gaacgattgc 2340 
ctggtcaaac cggtggatct gaatgccctt cagaactgct tgattaatat tctcaaggtg 2400 
gatcgatga 2409 
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ORF1-8 

SEQ ID NO: 15 

atgttgcgcc ggaatgatgc cggaaactac 
ctcgctacgg acgcagaggc gctggggagc 
ggcttcggtt tcatcggtgc tggcccactg 
gggtcttcga gctgggatct gatctatcac 
tggctccctc tgttacttgc ctctgcgttg 
ctggtgcgga gcatcgagcg acgcttgata 
aaggagagcg aagccttttc ccgtgcagtt 
ctgcgtcgtg ccgacgccgc agtggtcctg 
gatagcgagg cgattgccca cgacgcgccg 
gtgaagtgtt ctggagaaga actggaaacc 
acgcccaccc gctataacgg tgaagacgta 
cgcaagcgga tggaggcgga actggctcgc 
gccaagacgc tgtttctcgc caccatgagc 
cttggcacgc ttgagctgct tgggcgtacc 
aaggcaatcc agcattcctc gtcgaccctg 
tccaagatag aggccggcca actggaccta 
accgaagagg tcgtgcagtc gttcaccggt 
acctgcctct ctgcggagct gccgctgcgc 
attctcaaca acctgctgag caacgcggtg 
cacctgaagg ccagcgtggt cgatgccgaa 
accggcatgg ggatcaacgt cgaggatcag 
cgccgctccg agcatccggt cgcaggcacg 
gcgcagctaa tgaatggcag tctgaaactg 
agcctcaggc ttccgcttga gcggatcgcg 
tgcgccgtcc aagtgctggc gcctgtccgc 
tcccgctggg gtggaagggc catggtcgcg 
acctcgctgc tggtcgaagt gttactgctg 
ggatgccggg tggagctttc ccctcagggt 
tggctgctcg ggctcaacaa cctggacggc 
cgtctcgctg atccttcgac gccgccgata 
cgcgtcctag tggtggagga taacgcgatc 
gcgctgggct gcagcgtgga gctgctcttc 
acggcctgct tcgacgtggt gctcaccgat 
ctaaccgcgg agctacggcg ccaagggttc 
gccatgcgtg aggagcgcga gcgctgcatg 
ccggtggatc tgaatgccct tcagaactgc 
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Figure 6H 



atgctgctgg atcagcatgg gcaggtcgta 60 • 
ggtgcgtcgc ggacgctttt gcgtggagac 120 
ccgcagcata tggtgctttt ccagcacgtg 18.0 
atcggtatcg gtcgcctgtt gctggctctg 240 
gcactcgcag tcggcatcct actgcattgg 300 
gagcccgcaa agcgacgcct tgaagcattg 360 
atccaggccg cgcccgtcgc gctgtgcgtg 420 
gaaaatcccc aggcgcgcca atggctgggt 480 
agatggattt cccaggcgtt cgcaggaggt 540 
gaggcagggc tacatcttca tctcaattac 600 
ttgttctgcg ccttcagtga aatcagtgca 660 
gcaaaatccc tggcggatgc tgccaatgaa 720 
catgaaatcc gcacacctct gtacggcatg 780 
gagctgagtc ggcagcaggc cggttaccta 84 0 
ctgcaactga tcagcgatgt gcttgacgta 900 
gagtgcgtgg aattctcccc gctggaattg 960 
gccgcgcagg ccaaggggct gcagttgtat 1020 
atgcgggggg ccgcggcgtc gatccggcag 1080 
aagttcaccg acaatggcta tgtcaacgtc 1140 
tgtgtgatgc tgacctggca ggtcaacgat 1200 
ccgcgtctgt tcgaaccgtt ct'accagata 1260 
ggcctcggct tgtcgatcag ccagcgcctg 1320 
gtcagtgagc tggggttggg cagcagcttt 1380 
atgcaggctg agccgcagga cctagccggg 14-40 
gacctaacgg aatgcctgtg tggctggatc 1500 
acgccgaggt cgctggacga ggcggacgcg 1560 
gagggggcgc cgatgttcga agcatggcca 1620 
gatatggagc cgcaggcaca gggccgcgac 16 80- 
ctgcatcgtg ctctgggcct ggcccatggg 1740 
cggctggctc cgttgcgcaa tctaggtctc 1800 
aaccagttga tcttgaggga ccagatggaa 1860 
gatggtcgcg aggcgttgct gcactgccag 1920 
atcaacatgc cgaacatgaa cggatacgag 1980 
cggcagccga tcatcggcgc gacggcgaac 2040 
tccgccggga tgaacgattg cctggtcaaa 2100 
ttgattaata ttctcaaggt ggatcgatga 2160 
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Figure 61 

ORF1-9 

aihactactqq atcagcatgg gcaggtcgta ctcgctacgg acgcagaggc gctggggagc 60 
ggtgc|tcgc ggacgctt?t gcglggagac ggcttcggtt tcatcggtgc tggcccactg 120 
IcgcagcaL tggtgctttt ccagcacgtg gggtcttcga gctgggatct 3fctatcac 180 
a?cggtatcg gtcgcctgtt gctggctctg tggctccctc tgttacttgc ctctgcgttg 240 
acactcgcag tcggcatcct actgcattgg ctggtgcgga gcatcgagcg acgcttgata 300 
qaqcccgcaa agcgacgcct tgaagcattg aaggagagcg aagccttttc ccgtgcagtt 360 
atccaggccg cgcccgtcgc gctgtgcgtg ctgcgtcgtg ccgacgccgc agtggtcctg 420 
gaaaatcccc aggcgcgcca atggctgggt gatagcgagg cgattgccca cgacgcgccg 460 
aqatggattt cccaggcgtt cgcaggaggt gtgaagtgtt ctggagaaga actggaaacc 540 
oLqcagggc tacatcttca tctcaattac acgcccaccc gctataacgg tgaagacgta 600 
ttqttctgcg ccttcagtga aatcagtgca cgcaagcgga tggaggcgga actggctcgc 660 
qcaaaatccc tggcggatgc tgccaatgaa gccaagacgc tgtttctcgc caccatgagc 720 
catgaaatcc gcacacctct gtacggcatg cttggcacgc ttgagctgct tgggcgtacc 780 
qaqctgagtc ggcagcaggc cggttaccta aaggcaatcc agcattcctc gtcgaccctg 840- ■ 
ctgcaactga tcagcgatgt gcttgacgta tccaagatag aggccggcca actggaccta 900 
qaqtqcgtgg aattctcccc gctggaattg accgaagagg tcgtgcagtc gttcaccggt 960 
gccgcgcagg ccaaggggct g6agttgtat acctgcctct ctgcggagct gccgctgcgc 1020 
atgcgggggg ccgcggcgtc gatccggcag attctcaaca acctgctgag caacgcggtg 1080 
aaqttcaccg acaatggcta tgtcaacgtc cacctgaagg ccagcgtggt cgatgccgaa 1140 
tqtqtgatgc tgacctggca ggtcaacgat accggcatgg ggatcaacgt cgaggatcag 1200 
ccgcgtctgt tcgaaccgtt ctaccagata cgccgctccg agcatccggt- cgcaggcacg 1260 
ggcctcggct tgtcgatcag ccagcgcctg gcgcagctaa tgaatggcag tctgaaactg 1320 
qtcagtgagc tggggttggg cagcagcttt agcctoaggc ttccgcttga gcggatcgcg 1380 
atgcaggctg agccgcagga cctagccggg tgcgccgtcc aagtgctggc gcctgtccgc 1440 
qacctaacgg aatgcctgtg tggctggatc tcccgctggg gtggaagggc catggtcgcg 1500 
acgccgaggt cgctggacga ggcggacgcg acctcgctgc tggtcgaagt gttactgctg 1560 
qaqqgggcgc cgatgttcga agcatggcca ggatgccggg tggagctttc ccctcagggt 1620 
gatatggagc cgcaggcaca gggccgcgac tggctgctcg ggctcaacaa cctggacggc 1680 
ctqcatcgtg ctctgggcct ggcccatggg cgtctcgctg atccttcgac gccgccgata 1740 
cqgctggctc cgttgcgcaa tctaggtctc cgcgtcctag tggtggagga taacgcgatc 1800 
aaccagttga tcttgaggga ccagatggaa gcgctgggct gcagcgtgga gctgctcttc 1860 
qatggtcgcg aggcgttgct gcactgccag acggcctgct tcgacgtggt gctcaccgaf 1920 
atcaacatgc cgaacatgaa cggatacgag ctaaccgcgg agctacggcg ccaagggttc 1980 
cqqcagccga tcatcggcgc gacggcgaac gccatgcgtg aggagcgcga gcgctgcatg 2040 
tccgccggga tgaacgattg cctggtcaaa ccggtggatc tgaatgccct tcagaactgc 2100 
ttgattaata ttctcaaggt ggatcgatga 2130 
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Figure 6J 

ORF1-10 

SEQ ID NO: 17 

atggtgcttt tccagcacgt ggggtcttcg agctgggatc tgatctatca catcggtatc 60 
ggtcgcctgt tgctggctct gtggctccct ctgttacttg cctctgcgtt ggcactcgca 120 
gtcggcatcc tactgcattg gctggtgcgg agcatcgagc gacgcttgat agagcccgca 180 
aagcgacgcc ttgaagcatt gaaggagagc gaagcctttt cccgtgcagt tatccaggcc 240 
gcgcccgtcg cgctgtgcgt gctgcgtcgt gccgacgccg cagtggtcct ggaaaatccc 300 
caggcgcgcc aatggctggg tgatagcgag gcgattgccc acgacgcgcc gagatggatt 360 
tcccaggcgt tcgcaggagg tgtgaagtgt tctggagaag aactggaaac cgaggcaggg 420 
ctacatcttc atctcaatta cacgcccacc cgctataacg gtgaagacgt attgttctgc 480 
gccttcagtg aaatcagtgc acgcaagcgg atggaggcgg aactggctcg cgcaaaatcc 540 
ctggcggatg ctgccaatga agccaagacg ctgtttctcg ccaccatgag ccatgaaatc 600 
cgcacacctc tgtacggcat gcttggcacg cttgagctgc ttgggcgtac cgagctgagt 660 
cggcagcagg ccggttacct aaaggcaatc cagcattcct cgtcgaccct gctgcaactg 720 
atcagcgatg tgcttgacgt atccaagata gaggccggcc aactggacct agagtgcgtg 780 
gaattctccc cgctggaatt gaccgaagag gtcgtgcagt cgttcaccgg tgccgcgcag 840 
gccaaggggc tgcagttgta tacctgcctc tctgcggagc tgccgctgcg catgcggggg 900 
gccgcggcgt cgatccggca gattctcaac aacctgctga gcaacgcggt gaagttcacc 960 
gacaatggct atgtcaacgt ccacctgaag gccagcgtgg tcgatgccga atgtgtgatg 1020 
ctgacctggc aggtcaacga taccggcatg gggatcaacg tcgaggatca gccgcgtctg 1080 
ttcgaaccgt tctaccagat acgccgctcc gagcatccgg tcgcaggcac gggcctcggc 1140 
ttgtcgatca gccagcgcct ggcgcagcta atgaatggca gtctgaaact ggtcagtgag 1200 
ctggggttgg gcagcagctt tagcctcagg cttccgcttg agcggatcgc gatgcaggct 1260 
gagccgcagg acctagccgg gtgcgccgtc caagtgctgg cgcctgtccg cgacctaacg 1320 
gaatgcctgt gtggctggat ctcccgctgg ggtggaaggg ccatggtcgc gacgccgagg 13 80 
tcgctggacg aggcggacgc gacctcgctg ctggtcgaag tgttactgct ggagggggcg 1440 
ccgatgttcg aagcatggcc aggatgccgg gtggagcttt cccctcaggg tgatatggag 15 00 
ccgcaggcac agggccgcga ctggctgctc gggctcaaca acctggacgg cctgcatcgt 1560 
gctctgggcc tggcccatgg gcgtctcgct gatccttcga cgccgccgat acggctggct 1620 
ccgttgcgca atctaggtct ccgcgtccta gtggtggagg ataacgcgat caaccagttg 1680 
atcttgaggg accagatgga agcgctgggc tgcagcgtgg agctgctctt cgatggtcgc 1740 
gaggcgttgc tgcactgcca gacggcctgc ttcgacgtgg tgctcaccga tatcaacatg 1800 
ccgaacatga acggatacga gctaaccgcg gagctacggc gccaagggtt ccggcagccg 1860 
atcatcggcg cgacggcgaa cgccatgcgt gaggagcgcg agcgctgcat gtccgccggg 1920 
atgaacgatt gcctggtcaa accggtggat ctgaatgccc ttcagaactg cttgattaat 1980 
attctcaagg tggatcgatg a 2001 
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ORF1-11 

SEQ ID NO: 18 

atggaggcgg aactggctcg cgcaaaatcc 
ctgtttctcg ccaccatgag ccatgaaatc 
cttgagctgc ttgggcgtac cgagctgagt 
cagcattcct cgtcgaccct gctgcaactg 
gaggccggcc aactggacct agagtgcgtg 
gtcgtgcagt cgttcaccgg tgccgcgcag 
tctgcggagc tgccgctgcg catgcggggg 
aacctgctga gcaacgcggt gaagttcacc 
gccagcgtgg tcgatgccga atgtgtgatg 
gggatcaacg tcgaggatca gccgcgtctg 
gagcatccgg tcgcaggcac gggcctcggc 
atgaatggca gtctgaaact ggtcagtgag 
cttccgcttg agcggatcgc gatgcaggct 
caagtgctgg cgcctgtccg cgacctaacg 
ggtggaaggg ccatggtcgc gacgccgagg 
ctggtcgaag tgttactgct ggagggggcg 
gtggagcttt cccctcaggg tgatatggag 
gggctcaaca acctggacgg cctgcatcgt 
gatccttcga cgccgccgat acggctggct 
gtggtggagg ataacgcgat caaccagttg 
tgcagcgtgg agctgctctt cgatggtcgc 
ttcgacgtgg tgctcaccga tatcaacatg 
gagctacggc gccaagggtt ccggcagccg 
gaggagcgcg agcgctgcat gtccgccggg 
ctgaatgccc ttcagaactg cttgattaat 
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Figure 6K 



ctggcggatg ctgccaatga agccaagacg 60 
cgcacacctc tgtacggcat gcttggcacg 120 
cggcagcagg ccggttacct aaaggcaatc 180 
atcagcgatg tgcttgacgt atccaagata 240 
gaattctccc cgctggaatt gaccgaagag 300 
gccaaggggc tgcagttgta tacctgcctc 360 
gccgcggcgt cgatccggca gattctcaac 420 
gacaatggct atgtcaacgt ccacctgaag 4 80 
ctgacctggc aggtcaacga taccggcatg 540 
ttcgaaccgt tctaccagat acgccgctcc 600 
ttgtcgatca gccagcgcct ggcgcagcta 660 
ctggggttgg gcagcagctt tagcctcagg 720 * 
gagccgcagg acctagccgg gtgcgccgtc 780 
gaatgcctgt gtggctggat ctcccgctgg 840 
tcgctggacg aggcggacgc gacctcgctg 900 
ccgatgttcg aagcatggcc aggatgccgg 960 
ccgcaggcac agggccgcga ctggctgctc 1020 
gctctgggcc tggcccatgg gcgtctcgct 1080 
ccgttgcgca atctaggtct ccgcgtccta 1140 
atcttgaggg accagatgga agcgctgggc 1200 
gaggcgttgc tgcactgcca gacggcctgc 1260 
ccgaacatga acggatacga gctaaccgcg 1320 
atcatcggcg cgacggcgaa cgccatgcgt 1380 
atgaacgatt gcctggtcaa accggtggat 1440 
attctcaagg tggatcgatg a 1491 
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Figure 6L 

ORF1-1 

SEQ ID NO: 19 

Met Lys Leu Lys Asn Phe Leu Gin. Pro Phe Asp Ser Gly Phe Ser Thr 

x 5 10 15 

Pro Ser Ala Ala Leu Lys Leu Leu Arg Met Leu Gly Gly Ala Leu Met 

20 25, 30 

Leu Cys Val Leu Cys Ser Leu He Phe Ser Val Ser Met Val Leu Asn 

35 40 45 

His Gin Val Ser Leu Ser Arg Gin Ala Met Asn Val Ala Met Tyr Glu 

50 55 60 

Ala Gin Leu Tyr Phe Glu Gin Arg Glu Ala Leu Leu Asn His Leu Ser 
65 70 75 80 

Gly Asn Val Val Pro Leu Ala Ala Gly Arg Ala Leu Val Asn Glu Ala 

85 90 95 

Pro Asn Asn Val Ser He Leu Pro Leu Ser Asp Gly Gly Arg Gly Leu 

100 105 HO 

Leu Leu Thr Ala Arg Thr Leu Gly Asp Leu Arg Glu Lys Arg Leu Ala 

115 120 125 

Leu Met Tyr Leu Val Asp Thr Asp Lys Gly Pro Leu Val Tyr Arg Leu 

130 135 140 

Thr Ala Asp Gly Arg Pro Ser Ala Ala He Ser Ser Thr He Thr Lys 
145 150 155 160 

Glu Val Tyr Arg Ala Leu Leu Ala Thr Pro Ser Ala Pro Val His Trp 

165 170 175 

Val Thr Asp Gly Gly Thr Pro Gin Arg Leu Tyr Leu Phe Glu Ser Leu 

180 185 190 

Gly Asp Glu Pro Gly Glu Gly Trp Leu Gly. Leu Glu He Leu Gly Glu 

195 200 205 

Asp Leu Asp Ser Met Leu Arg Arg Asn Asp Ala Gly Asn Tyr Met Leu 

210 215 220 

Leu Asp Gin His Gly Gin Val Val Leu Ala Thr Asp Ala Glu Ala Leu 
225 230 235 240 

Gly Ser Gly Ala Ser Arg Thr Leu Leu Arg Gly Asp Gly Phe Gly Phe 

245 250 255 

He Gly Ala Gly Pro Leu Pro Gin His Met Val Leu Phe Gin His Val 

260 265 270 

Gly Ser Ser Ser Trp Asp Leu He Tyr His He Gly He Gly Arg Leu 

275 280 285 

Leu Leu Ala Leu Trp Leu Pro Leu Leu Leu Ala Ser Ala Leu Ala Leu 

290 295 300 

Ala Val Gly He Leu Leu His Trp Leu Val Arg Ser He Glu Arg Arg 
305 310 315 320 

Leu lie Glu Pro Ala Lys Arg Arg Leu Glu Ala Leu Lys Glu Ser Glu 

325 330 335 

Ala Phe Ser Arg Ala Val He Gin Ala Ala Pro Val" Ala Leu Cys Val 

340 345 350 

Leu Arg Arg Ala Asp Ala Ala Val Val Leu Glu Asn Pro Gin Ala Arg 

355 360 365 

Gin Trp Leu Gly Asp Ser Glu Ala He Ala His Asp Ala Pro Arg Trp 

370 375 380 

He Ser Gin Ala Phe Ala Gly Gly Val Lys Cys Ser Gly Glu Glu Leu 
385 390 395 400 

Glu Thr Glu Ala Gly Leu His Leu His Leu Asn Tyr Thr Pro Thr Arg 

405 410 415 

Tyr Asn Gly Glu Asp Val Leu Phe Cys Ala Phe Ser Glu He Ser Ala 

420 425 430 

Arg Lys Arg Met Glu Ala Glu Leu Ala Arg Ala Lys Ser Leu Ala Asp 

435 440 445 

Ala Ala Asn Glu Ala Lys Thr Leu Phe Leu Ala Thr Met Ser His Glu 
450 455 460 
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Tie Ara Thr Pro Leu Tyr Gly Met Leu Gly Thr Leu Glu Leu Leu Gly 
lis 9 470 475 480 

Ara Thr Glu Leu Ser Arg Gin Gin Ala Gly Tyr Leu Lys Ala He Gin 

485 ~ 490 495 

His Ser Ser Ser Thr Leu Leu Gin Leu He Ser Asp Val Leu Asp Val 

500 505 510 

Ser Lys He Glu Ala Gly Gin Leu Asp Leu Glu Cys Val Glu Phe Ser 

515 520 525 

Pro Leu Glu Leu Thr Glu Glu Val Val Gin Ser Phe Thr Gly Ala Ala 

530 535 540 

Gin Ala Lys Gly Leu Gin Leu Tyr Thr Cys Leu Ser Ala Glu Leu Pro 
545 * 550 555 560 

Leu Arg Met Arg Gly Ala Ala Ala Ser He Arg Gin He Leu Asn Asn 

565 570 575 

Leu Leu Ser Asn Ala Val Lys Phe Thr Asp Asn Gly Tyr Val Asn Val 

580 585 590 

His Leu Lys Ala Ser Val Val Asp Ala Glu Cys Val Met Leu Thr Trp 

595 . 600 605 

Gin Val Asn Asp Thr Gly Met Gly He Asn Val Glu Asp Gin Pro Arg 

610 615 620 

Leu Phe Glu Pro Phe Tyr Gin He Arg Arg Ser Glu His Pro Val Ala 
625 63 0 635 64 0 

Gly Thr Gly Leu Gly Leu Ser He Ser Gin Arg Leu Ala Gin Leu Met 

645 650 655 

Asn Gly Ser Leu Lys Leu Val Ser Glu Leu Gly Leu Gly Ser Ser Phe 

660 665 670 

Ser Leu Arg Leu Pro Leu Glu Arg He Ala Met Gin Ala Glu Pro Gin 

675 680 685 

Asp Leu Ala Gly Cys Ala Val Gin Val Leu Ala Pro Val Arg Asp Leu 

690 695 700 

Thr Glu Cys Leu Cys Gly Trp He Ser Arg Trp Gly Gly Arg Ala Met 
705 710 715 720 

Val Ala Thr Pro Arg Ser Leu Asp Glu Ala Asp Ala Thr Ser Leu Leu 

725 730 735 

Val Glu Val Leu Leu Leu Glu Gly Ala Pro Met Phe Glu Ala Trp Pro 

740 745 750 

Gly cys Arg Val Glu Leu Ser Pro Gin Gly Asp Met Glu Pro Gin Ala 

755 760 765 

Gin Gly Arg Asp Trp Leu Leu Gly Leu Asn Asn Leu Asp Gly Leu His 

770 775 780 

Arg Ala Leu Gly Leu Ala His Gly Arg Leu Ala Asp Pro Ser Thr Pro 
785 . 790 795 800 

Pro He Arg Leu Ala Pro Leu Arg Asn Leu Gly Leu Arg Val Leu Val 

805 810 815 

Val Glu Asp Asn Ala He Asn Gin Leu He Leu Arg Asp Gin Met Glu 

820 825 830 

Ala Leu Gly Cys Ser Val Glu Leu Leu Phe Asp Gly. Arg Glu Ala Leu 

835 840 845 

Leu His Cys Gin Thr Ala Cys Phe Asp Val Val Leu Thr Asp He Asn 

850 855 860 

Met Pro Asn Met Asn Gly Tyr Glu Leu Thr Ala Glu Leu Arg Arg Gin 
865 870 875 880 

Gly Phe Arg Gin Pro He He Gly Ala Thr Ala Asn Ala Met Arg Glu 

885 890 895 

Glu Arg Glu Arg Cys Met Ser Ala Gly Met Asn Asp Cys Leu Val Lys 

900 905 910 

Pro Val Asp Leu Asn Ala Leu Gin Asn Cys Leu He Asn He Leu Lys 

915 920 925 

Val Asp Arg 
930 
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Figure 6M 

ORF1-2 

SEQ ID NO:20 

Met Leu Gly Gly Ala Leu Met Leu Cys Val Leu Cys Ser Leu lie Phe 

! 5 10 15 

Ser Val Ser Met Val Leu Asn His Gin Val Ser Leu Ser Arg Gin Ala 

20 25 30. 

Met Asn Val Ala Met Tyr Glu Ala Gin Leu Tyr Phe Glu Gin Arg Glu 

"'35 40 45 

Ala Leu Leu Asn His Leu Ser Gly Asn Val Val Pro Leu Ala Ala Gly 

50 55 60 

Arg Ala Leu Val Asn Glu Ala Pro Asn Asn Val Ser lie Leu Pro Leu 
65 70. 75 80 

Ser Asp Gly Gly Arg Gly Leu Leu Leu Thr Ala Arg Thr Leu Gly Asp 

.85 90 95 

Leu Arg Glu Lys Arg Leu Ala Leu Met Tyr Leu Val Asp Thr Asp Lys 

100 105 110 

Gly Pro Leu Val Tyr Arg Leu Thr Ala Asp Gly Arg Pro Ser Ala Ala 

115 120 125 

He Ser Ser Thr He Thr Lys Glu Val Tyr Arg Ala Leu Leu Ala Thr 

130 135 140 

Pro Ser Ala Pro Val His Trp Val Thr Asp Gly Gly Thr Pro Gin Arg 
145 150 155 160 

Leu Tyr Leu Phe Glu Ser Leu Gly Asp Glu Pro Gly Glu Gly Trp Leu 

165 170 175 

Gly Leu Glu He Leu Gly Glu Asp Leu Asp Ser Met Leu Arg Arg Asn 

180 185 190 

Asp Ala Gly Asn Tyr Met Leu Leu Asp Gin His Gly Gin Val Val Leu 

195 200 205 

Ala Thr Asp Ala Glu Ala Leu Gly Ser Gly Ala Ser Arg Thr Leu Leu 

210 215 220 

Arg Gly Asp Gly Phe Gly Phe He Gly Ala Gly Pro Leu Pro Gin His 
225 230 235 240 

Met Val Leu Phe Gin His Val Gly Ser Ser Ser Trp Asp Leu He Tyr 

245 250 255. 

His He Gly He Gly Arg Leu Leu Leu Ala Leu Trp Leu Pro Leu Leu 

260 265 270 

Leu Ala Ser Ala Leu Ala Leu Ala Val Gly He Leu Leu His Trp Leu 

275 280 y 285 

Val Arg Ser He Glu Arg Arg Leu lie Glu Pro Ala Lys Arg Arg Leu 

290 295 300 

Glu Ala Leu Lys Glu Ser Glu Ala Phe Ser Arg Ala Val He Gin Ala 
305 310 315 320 

Ala Pro Val Ala Leu Cys Val Leu Arg Arg Ala Asp Ala Ala Val Val 

325 330 335 

Leu Glu Asn Pro Gin Ala Arg Gin Trp Leu Gly Asp Ser Glu Ala lie 

340 345 350 

Ala His Asp Ala Pro Arg Trp He Ser. Gin Ala Phe Ala Gly Gly Val 

355 360 365 

Lys Cys Ser Gly Glu Glu Leu Glu Thr Glu Ala Gly Leu His Leu His 

370 375 380 

Leu Asn Tyr Thr Pro Thr Arg Tyr Asn Gly Glu Asp Val Leu Phe Cys 
385 390 395 400 

Ala Phe Ser Glu He Ser Ala Arg Lys Arg Met Glu Ala Glu Leu Ala 

' 405 410 415 

Arg Ala Lys Ser Leu Ala Asp Ala Ala Asn Glu Ala Lys Thr Leu Phe 

420 425 430 

Leu Ala Thr Met Ser His Glu He Arg Thr Pro Leu Tyr Gly Met Leu 

435 440 445 

Gly Thr Leu Glu Leu Leu Gly Arg Thr Glu Leu Ser Arg Gin Gin Ala 
450 455 " 460 
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Gly 
465 


Tyr 


Leu 


Lys 


Aia 


Tip 

470 


uin 


ti4 e cpr Ser Ser Thr Leu Leu Gin Leu 

475 480 


He 


Ser 


Asp 


Val 


Leu 


Asp 


Ur»l 

vai 


Gov t.-v/q TTa Glu Ala Glv Gin Leu AsO 

ocl Jj Y a lie uiu /%jici \j x y vJ.ii j_jc^u no^* 






485 








Leu 


Glu 


Cys 


val 


blU 


irne 


Ser 


D-ro t.oii Gl\i Tien Thr Glu Glu Val Val 




500 








■ 505 .510 


Gin 


Ser 


Phe 


Thr 


Gly 


Ala 


Ala 


tjin Aia iiys \j±y ijcu uaij ucu 




515 








520 525 


Cys 


Leu 


Ser 


Ala 


Glu 


Leu 


Pro 


Leu Arg Met Arg biy Aia Aia Aia oer 


530 










535 


540 


He 


Arg 


Gin 


He 


Leu 


Asn 


Asn 


LieU IteU oer noil H-La Vai Uyb .file lili 


545 








550 




555 ooO 


Asp 


Asn 


Gly 


Tyr 


Val 


Asn 


Val 


r t ' _ T i rm 7\ 1 « o Q v" XTa 1 17a 1 2\ctv Bin 

His Leu Lys Aia oer vai vai Asp 






565 




* 


570 5/o 


Glu 


Cys 


Val 


Met 


Leu 


Thr 


Trp 


Gin vai Asn Asp inr uiy net. yxy j.j.c 




580 








585 590 


Asn 


Val 


Glu 


Asp 


Gin 


Pro 


Arg 


Leu Phe Glu Pro Pne Tyr Gin lie Arg 




'595 








600 605 


Arg 


Ser 


Glu 


His 


Pro 


val 


Ala 


Gly Thr Gly Leu Gly Leu Ser He Ser 


610 










615 


620 


Gin 


Arg 


Leu 


Ala 


Gin 


Leu 


Met 


Asn Gly oer Leu Lys i>eu vai oer uiu 


625 








630 




635 640 


Leu 


Gly 


Leu 


Gly 


Ser 


Ser 


Phe 


Ser Leu Arg Leu Pro Leu Glu Arg He 








645 






650 655 


Ala 


Met 


Gin 


Ala 


Glu 


Pro 


Gin 


Asp Leu Ala Gly Cys Ala Val Gin Val 






660 








665 670 


Leu 


Ala 


Pro 
675 


Val 


Arg 


Asp 


Leu 


Thr Glu Cys Leu Cys Gly Trp He Ser 
680 685 


Arg 


Trp 


Gly 


Gly 


Arg 


Ala 


Met 


Val Ala Thr Pro Arg Ser Leu Asp Glu 


690 








695 


700 


Ala 


Asp 


Ala 


Thr 


Ser 


Leu 


Leu 


Val Glu Val Leu Leu Leu Glu Gly Ala 


705 








710 




715 720 


Pro 


Met 


Phe 


Glu 


Ala 
725 


Trp 


Pro 


Gly Cys Arg Val Glu Leu Ser Pro Gin 
730 735 


Gly 


Asp 


Met 


Glu 


Pro 


Gin 


Ala 


Gin Gly Arg Asp Trp Leu Leu Gly Leu 




740 








745 * 750 


Asn 


Asn 


Leu 


Asp 


Gly 


Leu 


His 


Arg Ala Leu Gly Leu Ala His Gly Arg 






755 








760 765 


Leu 


Ala 

770 


Asp 


Pro 


Ser 


Thr 


Pro 
775 


Pro He Arg Leu Ala Pro Leu Arg Asn 
. 780 


Leu 


Gly 


Leu 


Arg 


Val 


Leu 


Val 


Val Glu Asp Asn Ala He Asn Gin Leu 


785 








790 




795 800 


He 


Leu 


Arg 


Asp 


Gin 
805 


Met 


Glu 


Ala Leu Gly Cys Ser Val Glu Leu Leu 
810 815 


Phe 


Asp 


Gly 


Arg 


Glu 


Ala 


Leu 


Leu His Cys Gin Thr Ala Cys Phe Asp 




820 








825 830 


Val 


Val 


Leu 


Thr 


Asp 


He 


Asn 


Met Pro Asn Met Asn Gly Tyr Glu Leu 






835 








840 845 


Thr 


Ala 
850 


Glu 


Leu 


Arg 


Arg 


Gin 
855 


Gly Phe Arg Gin Pro He He Gly Ala 
860 


Thr 


Ala 


Asn 


Ala 


Met 


Arg 


Glu 


Glu Arg Glu Arg Cys Met Ser Ala Gly 


865 










870 




875 880 


Met 


Asn 


Asp 


Cys 


Leu 


Val 


Lys 


Pro Val Asp Leu Asn Ala Leu Gin Asn 








685 






890 895 


Cys 


Leu 


He 


Asn 


He 


Leu 


Lys 


Val Asp Arg 






900 








905 
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Figure' 6N 

SEQ ID NO: 21 
ORF1-3 

Met Leu Cys Val Leu Cys Ser Leu He Phe Ser Val Ser Met Val Leu 

1 5 10 15 

Asn His Gin Val Ser Leu Ser Arg Gin Ala Met Asn Val .Ala Met Tyr 

20 25 30 

Glu Ala Gin Leu Tyr Phe Glu Gin Arg Glu Ala Leu Leu Asn His Leu 

35 40 45 

Ser Gly Asn Val Val Pro Leu Ala Ala Gly Arg Ala Leu Val Asn Glu 

50 55 60 

Ala Pro Asn Asn Val Ser He Leu Pro Leu Ser Asp Gly Gly Arg Gly 
65 70 75 80 

Leu Leu Leu Thr Ala Arg Thr Leu Gly Asp Leu Arg Glu Lys Arg Leu 

85 90 95 

Ala Leu Met Tyr Leu Val Asp Thr Asp Lys Gly Pro Leu Val Tyr Arg 

100 105 110 • 

Leu Thr Ala Asp Gly Arg Pro Ser Ala Ala He Ser Ser Thr He Thr 

115 " ' 120 125 

Lys Glu Val Tyr Arg Ala Leu Leu Ala Thr Pro Ser Ala Pro Val His 

130 " 135 140 

Trp Val Thr Asp Gly Gly Thr Pro Gin Arg Leu Tyr Leu Phe Glu Ser 
145 150 155 160 

Leu Gly Asp Glu Pro Gly Glu Gly Trp Leu Gly Leu Glu He Leu Gly 

165 170 175 

Glu Asp Leu Asp Ser Met Leu Arg Arg Asn Asp Ala Gly Asn Tyr Met 

180 185 190 

Leu Leu Asp Gin His Gly Gin Val Val Leu Ala Thr Asp Ala Glu Ala 

195 200 205 

Leu Gly Ser Gly Ala Ser Arg Thr Leu Leu Arg Gly Asp Gly Phe Gly 

210 215 220 

Phe He Gly Ala Gly Pro Leu Pro Gin His Met Val Leu Phe Gin His 
225 230 235 240 

Val Gly Ser Ser Ser Trp Asp Leu He Tyr His He Gly He Gly Arg 

245 250 255 

Leu Leu Leu Ala Leu Trp Leu Pro Leu Leu lieu Ala Ser Ala Leu Ala 

260 " 265 270 

Leu Ala Val Gly He Leu Leu His Trp Leu Val Arg Ser He Glu Arg 

275 " 280 285 

Arg Leu He Glu Pro Ala Lys Arg Arg Leu Glu Ala Leu Lys Glu Ser 

290 295 300 

Glu Ala Phe Ser Arg Ala Val He Gin Ala Ala Pro Val Ala Leu Cys 
305 310 315 320 

Val Leu Arg Arg Ala Asp Ala Ala Val Val Leu Glu Asn Pro Gin Ala 

325 330 . 335 

Arg Gin Trp Leu Gly Asp Ser Glu Ala He Ala His Asp Ala Pro Arg 

340 345 350 

Trp lie Ser Gin Ala Phe Ala Gly Gly Val Lys Cys Ser Gly Glu Glu 

355 360 365 

Leu Glu Thr Glu Ala Gly Leu His Leu His Leu Asn Tyr Thr Pro Thr 

370 375 380 

Arg Tyr Asn Gly Glu Asp Val Leu Phe Cys Ala Phe Ser Glu He Ser 
385 - 390 395 400 

Ala Arg Lys Arg Met Glu Ala Glu Leu Ala Arg Ala Lys Ser Leu Ala 

4 05 410 415 

Asp Ala Ala Asn Glu Ala Lys Thr Leu Phe Leu Ala Thr Met Ser His 

420 425 430 

Glu He Arg Thr Pro Leu. Tyr Gly Met Leu Gly Thr Leu Glu Leu Leu 

435 440 445 

Gly Arg Thr Glu Leu Ser Arg Gin Gin Ala Gly Tyr Leu Lys Ala lie 
450 455 460 
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Gin His Ser Ser Ser Thr Leu Leu Gin Leu He Ser Asp 

465 470 475 

Val Ser Lys He Glu Ala Gly Gin Leu Asp Leu Glu Cys 

485 490 
Ser Pro Leu Glu Leu Thr Glu Glu Val Val Gin Ser Phe 

500 505 
Ala Gin Ala Lys Gly Leu Gin Leu Tyr Thr Cys Leu Ser 
515 520 " 525 

Pro Leu Arg Met Arg Gly Ala Ala Ala Ser He Arg Gin 

530 535 540 

Asn Leu Leu Ser Asn Ala Val Lys Phe Thr Asp Asn Gly 
545 550 555 

Val His Leu Lys Ala Ser Val Val Asp Ala Glu Cys Val 

565 570 
Trp Gin Val Asn Asp Thr Gly Met Gly He Asn Val Glu 

580 585 
Arg Leu Phe Glu Pro Phe Tyr Gin He Arg Arg Ser Glu 
595 600 " 605 

Ala Gly Thr Gly Leu Gly Leu Ser He Ser Gin Arg Leu 

610 615 620 

Met Asn Gly Ser Leu Lys Leu Val Ser Glu Leu Gly Leu 
625 630 635. 

Phe Ser Leu Arg Leu Pro Leu Glu Arg He Ala Met Gin 

645 650 
Gin Asp Leu Ala Gly Cys Ala Val Gin Val Leu Ala Pro 

660 665 
Leu Thr Glu Cys Leu Cys Gly Trp lie Ser Arg Trp Gly 
675 680 685 

Met Val Ala Thr Pro Arg Ser Leu Asp Glu Ala Asp Ala 

690 695 700 

Leu Val Glu Val Leu Leu Leu Glu Gly Ala Pro Met Phe 
705 710 715 

Pro Gly Cys Arg Val Glu Leu Ser Pro Gin Gly Asp Met 

725 730 
Ala Gin Gly Arg Asp Trp Leu Leu Gly Leu Asn Asn Leu 

740 745 
His Arg Ala Leu Gly Leu Ala His Gly Arg Leu Ala Asp 
755 760 ~ 765 

Pro Pro He Arg Leu Ala Pro Leu Arg Asn Leu Gly Leu 

770 775 780 

Val Val Glu Asp Asn Ala lie Asn Gin Leu He Leu Arg 
785 790 795 

Glu Ala Leu Gly Cys Ser Val Glu Leu Leu Phe Asp Gly 

805 810 
Leu Leu His Cys Gin Thr Ala Cys Phe Asp Val Val Leu 

820 825 
Asn Met Pro Asn Met Asn Gly Tyr Glu Leu Thr Ala Glu 
835 840 845 

Gin Gly Phe Arg Gin Pro He He Gly Ala Thr Ala Asn 

850 855 860 

Glu Glu Arg Glu Arg Cys Met Ser Ala Gly Met Asn Asp 
865 870 875 

Lys Pro Val Asp Leu Asn Ala Leu Gin Asn Cys Leu lie 
885 890 

Lys Val Asp Arg 
900 



Val Leu Asp 
480 

Val Glu Phe 

495 
Thr Gly Ala 
510 

Ala Glu Leu 

lie Leu Asn 

Tyr Val Asn 
560 

Met Leu Thr 
575 

Asp Gin Pro' 
590 

His Pro Val 

Ala Gin Leu 

Gly Ser Ser 
640 

Ala Glu Pro 

655 
Val Arg Asp 
670 

Gly Arg Ala 

Thr Ser Leu 

Glu Ala Trp 
720 

Glu Pro Gin 

735 
Asp Gly Leu 
750 

Pro Ser Thr 

Arg Val Leu 

Asp Gin Met 
800 

Arg Glu Ala 

815 
Thr Asp lie 
830 

Leu Arg Arg 

Ala Met Arg 

Cys Leu Val 
880 

Asn He Leu 
895 
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Figure 60 

ORF1-4 

SEQ ID NO: 22 

Met Val Leu Asn His Gin Val Ser Leu Ser Arg Gin Ala Met Asn Val 

1 5 10 15 

Ala Met Tyr Glu Ala Gin Leu Tyr Phe Glu Gin Arg Glu Ala Leu Leu 

20 25 30 

Asn His Leu Ser Gly Asn Val Val Pro Leu Ala Ala Gly Arg Ala Leu 

35 40 45 

Val Asn Glu Ala Pro Asn Asn Val Ser He Leu Pro Leu Ser Asp Gly 

50 55 60 

Gly Arg Gly Leu Leu Leu Thr Ala Arg Thr Leu Gly Asp Leu Arg Glu 
65 ~ 70 75 80 

Lys Arg Leu Ala Leu Met Tyr Leu Val Asp .Thr Asp Lys Gly Pro Leu 

85 90 95 . 

Val Tyr Arg Leu Thr Ala Asp Gly Arg Pro Ser Ala Ala He Ser Ser 

100 105 HO 

Thr He Thr Lys Glu Val Tyr Arg Ala Leu Leu Ala Thr Pro Ser Ala 

.115 120 125 

Pro Val His Trp Val Thr Asp Gly Gly Thr Pro Gin Arg Leu Tyr Leu 

130 135 140 

Phe Glu Ser Leu Gly Asp Glu Pro Gly Glu Gly Trp Leu Gly Leu Glu 
145 150 155 160 

He Leu Gly Glu Asp Leu Asp Ser Met Leu Arg Arg Asn Asp Ala Gly 

165 170 175 

Asn Tyr Met Leu Leu Asp Gin His Gly Gin Val Val Leu Ala Thr Asp 

180 185 190 

Ala Glu Ala Leu Gly Ser Gly Ala Ser Arg *Thr Leu Leu Arg Gly Asp 

195 200 205 

Gly Phe Gly Phe He Gly Ala Gly Pro Leu Pro Gin His Met Val Leu 

210 215 220 

Phe Gin His Val Gly Ser Ser Ser Trp Asp Leu He Tyr His He Gly 
225 230 235 240 

He Gly Arg Leu Leu Leu Ala Leu Trp Leu Pro Leu Leu Leu Ala Ser 

245 250 255 

Ala Leu Ala Leu Ala Val Gly He Leu Leu His Trp Leu Val Arg Ser 

260 265 270 

He Glu Arg Arg Leu He Glu Pro Ala Lys Arg Arg Leu Glu Ala Leu 

275 280 285 

Lys Glu Ser Glu Ala Phe Ser Arg Ala Val He Gin Ala Ala Pro Val 

290 295 300 

Ala Leu Cys Val Leu Arg Arg Ala Asp Ala Ala Val Val Leu Glu Asn 
305 310 315 320 

Pro Gin Ala Arg Gin Trp Lea Gly Asp Ser Glu Ala He Ala His Asp 

325 330 335 

Ala Pro Arg Trp He Ser Gin Ala Phe Ala Gly Gly Val Lys Cys Ser 

340 345 350 

Gly Glu Glu Leu Glu Thr Glu Ala Gly Leu His Leu His Leu Asn Tyr 

355 360 365 

Thr Pro Thr Arg Tyr Asn Gly Glu Asp Val Leu Phe Cys Ala Phe Ser 

370 375 380 

Glu He Ser Ala Arg Lys Arg Met Glu Ala Glu Leu Ala Arg Ala Lys 
385 390 395 400 

Ser Leu Ala Asp Ala Ala Asn Glu Ala Lys Thr Leu Phe Leu Ala Thr 

405 410 415 

Met Ser His Glu He Arg Thr Pro Leu Tyr Gly Met Leu Gly Thr Leu 

420 425 430 

Glu Leu Leu Gly Arg Thr Glu Leu Ser Arg' Gin Gin Ala Gly Tyr Leu 

435 440 445 

Lys Ala He Gin His Ser Ser Ser Thr Leu Leu Gin Leu He Ser Asp 
450 455 460 
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Val 


Leu Asp Val Ser Lys 


lie 


Glu 


Ala 


Gly Gin Leu Asp Leu Glu Cys 


465 


470 








475 480 


Val 


Glu Phe Ser Pro Leu 


Glu 


Leu 


Thr 


Glu Glu Val Val Gin Ser Phe 




4B5 








490 495 


Thr 


Gly Ala Ala Gin Ala 


Lys 


Gly 


Leu 


Gin Leu Tyr Thr Cys Leu Ser 




500 






505 


510 


Ala 


Glu Leu Pro Leu Arg 


Met 


Arg 


Gly 


Ala Ala Ala Ser He Arg Gin 




515 




520 




525 


lie 

•lie 


Leu Asn Asn Leu Leu 


Ser 


Asn 


Ala 


Val Lys Phe Thr Asp Asn Gly 




530 


535 






540 


±yx 


Val Asn Val His Leu 


Lvs 


Ala 


Ser 


Val Val Asp Ala Glu Cys Val 




550 








555 560 


Mpf 

ric l. 


Leu Thr Tm Gin Val 


Asn 


Asn 


Thr 


Glv Met Glv He Asn Val Glu 




565 








570 575 




Gin Pro Ara Leu Phe 


Glu 


Pro 


Phe 


Tvr Gin He Ara Ara Ser Glu 


580 






585 


590 


His 


Pro Val Ala Gly Thr 


Glv 


Leu 


Glv 


Leu Ser He Ser Gin Ara Leu 








600 




605 


Al a 


Gin Leu Met Asn Glv 


Ser 


Leu 


Lvs 


Leu Val Ser Glu Leu Glv Leu 






615 

O X Zj 










Car Qpr PVlP fipr TiPU 

OCX OCX i ilC uCJ. LICU 


Arcr 


Leu 


Prn 


T»pii Gin Aro Tip 2X1 a Mpr* Gin 

XJCU VJ X U, niy XXC nla I'ICL V3J.il 


/roc 


O J w 








U J3 Olv 


Al a 


nin Pro Gin As"D Leu 


Ala 


Glv 

\jxy 


Cvs 
y 


Ala Val Gin Val Leu Ala Pro 




645 








650 655 


vai 


am fisD Leu Thr Glu 


Pvs 
v»y a 


Leu 


Cvs 
L.y d 


Glv Trn Tip ^pr Ar*o Tm Olv 
wij x x wy xxc ocx nxy xxlv \jj~y 




660 






O D D 




Gly 


Arg Ala Met Val Ala 


Thr 


Pro 


Arg 


Ser Leu Asn Glu A"l a Asn Al a 




O / 3 




O O V 




O D j 


T"h>- 
■LllX 


Cot* TiPII T«P11 Val Gl 11 
OCX XJCU ucu vox uxu 


Val 


Leu 


Leu 


T .on Gin Glv Si a Prr*i Mot~ DVia 
XJCU V7X U wXy nlO rlU NCL JTiiC 






foe 






7n ft 


Glu 


nl a T'rn Prn fil v 


Arg 


Val 


Glu 


T.pii Cpy T)rr» Gin Gl \f Son Mpf* 
XJCU OCX riU V3XI1 vjxy nop ncL. 


H f\ C 


/ lu 










Glu 


Drn flln Ala Gin Glv 




Asp 


XX p 


Tipn TiPii Glv TiPii ZXqti Ann TiPii 
XJCU XJCU \jxy XJCU nsu no IX XJCU 




l«3 








Tin 7 *i t; 

/Ju /JO 


Asp 


Gl \/ TiPii TTt k Arcr TXl a 
OXy xjcu nj-b niy nio 


XJC u 


vaxy 


XJC Li 


Al a U i c f2l \r &YTI T , o 1 1 Til a lier^ 

jhxo xixfa vaxy i-ix y ucu .ma Mop 


7Z. n 






7/1 5 


7 R ft 


riO 


ear Thr Prn Prn T 1 p 
ocx a iix nu r i u xx c 


SrO 


XJC u 


Al a 


Pro T l Arrr fiori T .pii /3l%r T.on 
ri u xjcu niy jHoii xjcu Aaxy xjcu 




/33 








/ OD 


Ar 9 


Val Leu Val Val Glu 

val xjcu vox v a a w<lu 


Asp 


fieri 
noil 


Ala 


Tl <3 Sen Gl n T»^il Tl ^ TiDii Rrn 

xxc jnoix win xjcu xxc xjcu rixy 


•77A 


775 
/ / o 






7Rfl 


rtft lj 


Gin Met Glu Ala Leu 


Glv 

wxy 


Cvs 


Ser 


Val Glu Leu Lpu Php Asn Glv 

VOX U1U JJv U XJCU r 11C AO^ . wxy 


7 J5 C 


790 








7 Q CI con 


Urn 


Glu Ala Leu Leu His 


Cvs 


Gin 


Thr 


Ala Pvs Php Asn Val Val TiPii 
s\± a v,y d jtxjc no vctx vcix xjcu 


805 








ft! 0 PI c; 


Thr 


Asp He Asn Met Pro 


Asn 


Met 


Asn 


Glv Tvr Glu Leu Thr Ala Glu 




82 0 






825 


' 830 


Leu 


Arg Arg Gin Gly Phe 


Arg 


Gin 


Pro 


He He Gly Ala Thr Ala Asn 




835 




840 




845 


Ala 


Met Arg Glu Glu Arg 


Glu 


Arg 


Cys 


Met Ser Ala Gly Met Asn Asp 




850 


855 






860 


Cys 


Leu Val Lys Pro Val 


Asp 


Leu 


Asn 


Ala Leu Gin Asn Cys Leu He 


865 


870 








875 880 


Asn 


He Leu Lys Val Asp 


Arg 









885 
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Figure 6P 

ORF1-5 

SEQ ID NO: 23 

Met Asn Val Ala Met Tyr Glu Ala Gin Leu Tyr Phe Glu Gin Arg Glu 

1 5 10 15 

Ala Leu Leu Asn His Leu Ser Gly Asn Val Val Pro Leu Ala Ala Gly 

20 25 30 

Arg Ala Leu Val Asn Glu Ala Pro Asn Asn Val Ser lie Leu Pro Leu 

35 40 45 

Ser Asp Gly Gly Arg Gly Leu Leu Leu Thr Ala Arg Thr Leu Gly Asp 

50 55 60 

Leu Arg Glu Lys Arg Leu Ala Leu Met Tyr Leu Val Asp Thr Asp Lys 
65 70 75 80 

Gly Pro Leu Val Tyr Arg -Leu Thr Ala Asp Gly Arg Pro Ser Ala Ala 

85 . 90 95 

lie Ser Ser Thr lie Thr Lys Glu Val Tyr Arg Ala Leu Leu Ala Thr 

100 105 110 

Pro Ser Ala Pro Val His Trp Val Thr Asp Gly Gly Thr Pro Gin Arg 

115 120 125 

Leu Tyr Leu Phe Glu Ser Leu Gly Asp Glu Pro Gly Glu Gly Trp Leu 

130 135 140 

Gly Leu Glu lie Leu Gly Glu Asp Leu Asp Ser Met Leu Arg Arg Asn 
145 150 155 160 

Asp Ala Gly Asn Tyr Met Leu Leu Asp Gin His Gly Gin Val Val Leu 

165 170 175 

Ala Thr Asp Ala Glu Ala Leu Gly Ser Gly Ala Ser Arg Thr Leu Leu 

180 185 190 

Arg Gly Asp Gly Phe Gly Phe He Gly Ala Gly Pro Leu Pro Gin His 

195 200 ' 205 

Met Val Leu Phe Gin His Val Gly Ser Ser Ser Trp Asp Leu He Tyr 

210 215 220 

His He Gly He Gly Arg Leu Leu Leu Ala Leu Trp Leu Pro Leu Leu 
225 230 235 240 

Leu Ala Ser Ala Leu Ala Leu Ala Val Gly He Leu Leu His Trp Leu 

245 250 255 

Val Arg Ser He Glu Arg Arg Leu He Glu Pro Ala Lys Arg Arg Leu 

260 265 270 

Glu Ala Leu Lys Glu Ser Glu Ala Phe Ser Arg Ala Val He Gin Ala 

275 280 285 

Ala Pro Val Ala Leu Cys Val Leu Arg Arg Ala Asp Ala Ala Val Val 

290 295 300 

Leu Glu Asn Pro Gin Ala Arg Gin Trp Leu Gly Asp Ser Glu Ala He 
305 310 315 320 

Ala His Asp Ala Pro Arg Trp He Ser Gin Ala Phe Ala Gly Gly Val 

325 330 335 

Lys Cys Ser Gly Glu Glu Leu Glu Thr Glu Ala Gly Leu His Leu His 

340 345 350 

Leu Asn Tyr Thr Pro Thr Arg Tyr Asn Gly Glu Asp Val Leu Phe Cys 

355 360 365 

Ala Phe Ser Glu He Ser Ala Arg Lys Arg Met Glu Ala Glu Leu Ala 

370 375 380 

Arg Ala Lys Ser Leu Ala Asp Ala Ala Asn Glu Ala Lys Thr Leu Phe 
385 390 395 400 

Leu Ala Thr Met Ser His Glu He Arg Thr Pro Leu Tyr Gly Met Leu 

405 410 415 . 

Gly Thr Leu Glu Leu Leu Gly Arg Thr Glu Leu Ser Arg Gin Gin Ala 

420 425 430 

Gly Tyr Leu Lys Ala He Gin His Ser Ser Ser Thr Leu Leu Gin Leu 

435 440 445 

He Ser Asp Val Leu Asp Val Ser Lys He Glu Ala Gly Gin Leu Asp 
450 455 460 
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Leu 


Glu 


Cvs 


Val 


Glu 


Phe 


Ser 


Pro Leu Glu Leu Thr Glu Glu Val Val 


4 ob 








470 




475 480 




Ser 


Phe 

rut 


Thr 


Glv 


Ala 


Ala 


Gin Ala Lys Gly Leu Gin Leu Tyr Thr 








4 85 






490 495 


Cys 


UCU 


Car 
O Ci 


Ala 


■Glu 


Leu 


Pro 


Leu Arg Met Arg Gly Ala Ala Ala Ser 














505 510 


He 


Arg 


Gin 

13111 


Tie 


Leu 


Asn 


Asn 


Leu Leu Ser Asn Ala Val Lys Phe Thr 




CI C 










520 525 


Asp 


Asn 




Tvt* 


vox 


Asn 


Val 


His Leu Lys Ala Ser Val Val Asp Ala 


b J U 










535 


540 


olu 


Cys 


Val 
Val 


Met 


UCU 


Thr 




Gin Val Asn Asp Thr Gly Met Gly He 


r / r 

54b 








-J O V 




555 560 


Asn 


vaj. 


r»l n 


Asp 


Gl n 
bill 






Leu Phe Glu Pro Phe Tyr Gin He Arg 








SOS 






570 575 


Arg 


Ser 


fll ii 


XJ4.& 


Jr X. U 


Val 


Ala 


Gly Thr Gly Leu Gly Leu Ser lie Ser 






con 








RftR 590 


Gin 


Arg 




Al a 

M J. d 


Gl n 


Leu 


Met 


Asn Gly Ser Leu Lys Leu Val Ser Glu 




595 










end 605 


Leu 


Gly 


Leu 


uiy 


Ser 


Coy* 
OCX 




cpr Tipii Area Leu Pro Leu Glu Ara He 




610 










old 


620 


Ala 


Met 




Mia 


ulU 


Pro 


m n 

Ul 11 


Ann Lipii Ala Glv Cvs Ala Val Gin Val 


625 














fi!5 640 


Leu 


Ala 


Pro 


Val 


Arg 


Asp 


Leu 


Thr Gin Pvfs Tjeu Pvfi Glv Tim lie Ser 

1111 Wi.U VJT D UC LI \*yB wAJf A A J. B WCl 










645 






£t;n fi55 


Arg 


Trp 


Gly 


Giy 


Arg 


Mia 


Mat- 
net 


Val 2X1 a TViy* Pro Aro Ret* Tjeu Asf) Glu 

val 1111 It l w >iiy OCX JJCU nD^l v<LU 




660 








DC j D/U 


Ala 


Asp 


Ala 


Thr 


Ser 


Leu 


Leu 


Val f2ln 17a 1 T.^n T.e»n Tif^ii Gl 11 Glv Ala 
Val ulu Val JjcU lieu ucu V3iu uiy nia 




675 












Pro 


Met 


Pne 


pi,. 

UiU 


A J. a 


Trp 


Pro 


v C\rc ~h.yn Val d"\ ii T.^u Cpr Prn " fil n 
uiy Ly b ni y val uiu DC u oci tr±\j viu 




690 










695 


•7 n n 


Gly 


Asp 


Met 


u±U 


Pro 


o in 


21 1 a 
Ala 


f2"l n fil v 7\ -yn H en Tm TiPii TiPii Glv Tjf»"n 
olil uiy Ml y nop lip ucu ucu \j jl y ucu 


705 








7 10 




•7 1. 1 7 o n 


Asn 


Asn 


Leu 


Asp 




Leu 


ill o 


1\y ri Zil a T^eii Glv T.*»ii ATa "Hi ^ Glv At* a 
niy Mia ucu v3 1 _y ucu nJ-o nio uiy rviy 










725 






■ 71 n 77R 


Leu 


Ala 


Asp 


pro 


Ser 


mr 


Pro 


rlO lie /iig iicU /ild riU ucu niy Moll 






740 








745 /3U 


Leu 


Gly 


Leu 


Arg 


Val 


Leu 


Val 


Val wlu Mouy Moil Mia lie Moll will ucu 




755 


■ 








/Oil /OS 


He 


Leu 


Arg 


Asp 


uin 


ncL 


f2l ii 


Ala Ti^n Gl \/ G\/e Qpr* Val Gl ii T«pii Ti«^n 
Mid ucu uiy b oci vai uiu ucu ucu 




770 










nnc 
/ /3 


7 P n 


Pne 


Asp 


oiy 


Arg 


n*\ ii 


nld 


Leu 


Ti^n "Hi ft Pvfl Gin TViy Ala. Cvs Phe Asti 

ucu iiio Y will nil rtj-a ^.y d riit rto^ 1 


785 






/ y\j 




Toe flOO 


Val 


Val 


Leu 


Thr 


Asp 


He 


Asn 


Met Pro Asn Met Asn Gly Tyr Glu Leu 










805 






810 815 


Thr 


Ala 


Glu 


Leu 


Arg 


Arg 


Gin 


Gly Phe Arg Gin Pro lie lie Gly Ala 








820 






825 830 


Thr 


Ala 


Asn 


Ala 


Met 


Arg 


Glu 


Glu Arg Glu Arg Cys Met Ser Ala Gly 






835 










840 " 845 


Met 


Asn 


Asp 


Cys 


Leu 


Val 


Lys 


Pro Val Asp Leu Asn Ala Leu Gin Asn 




850 










855 


860 


Cys 


Leu 


He 


Asn 


He 


Leu 


Lys 


Val Asp Arg 



865 870 



fcf/v 
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Figure 6Q 

ORF1-6 

SEQ ID NO: 24 

Met Tyr Glu Ala Gin Leu Tyr Phe Glu Gin Arg Glu Ala Leu Leu Asn 

! 5 10 15 

His Leu Ser Gly Asn Val Val Pro Leu Ala Ala Gly Arg Ala Leu Val 

20 25 30 ' 

Asn Glu Ala Pro Asn Asn Val Ser lie Leu Pro Leu Ser Asp Gly Gly 

35 40 45 

Arg Gly Leu Leu Leu Thr Ala Arg Thr Leu Gly Asp Leu Arg Glu Lys 

50 55 60 

Arg Leu Ala Leu Met Tyr Leu Val Asp Thr Asp Lys Gly Pro Leu Val 
65 70 75 80 

Tyr Arg Leu Thr Ala Asp Gly Arg Pro Ser Ala Ala lie Ser Ser Thr 

85 90 95 

He Thr Lys Glu Val Tyr Arg Ala Leu Leu Ala Thr Pro Ser Ala Pro 

100 105 110 

Val His Trp Val Thr Asp Gly Gly Thr Pro Gin Arg Leu Tyr Leu Phe 

115 120 125 

Glu Ser Leu Gly Asp Glu Pro Gly Glu Gly Trp Leu Gly Leu Glu He 

130 135 140 

Leu Gly Glu Asp Leu Asp Ser Met Leu Arg Arg Asn Asp Ala Gly Asn 
145 150 155 160 

Tyr Met Leu Leu Asp Gin His Gly Gin Val Val Leu Ala Thr Asp Ala 

165 170 175 

Glu Ala Leu Gly Ser Gly Ala Ser Arg Thr Leu Leu Arg Gly Asp Gly 

180 185 190 

Phe Gly Phe He Gly Ala Gly Pro Leu Pro Gin His Met Val Leu Phe 

195 200 205 

Gin His Val Gly Ser Ser Ser Trp Asp Leu He Tyr His He Gly He 

210 . 215 220 

Gly Arg Leu Leu Leu Ala Leu Trp Leu Pro Leu Leu Leu Ala Ser Ala 
225 230 235 240 

Leu Ala Leu Ala Val Gly He Leu Leu His Trp Leu Val Arg Ser He 

245 250 255 

Glu Arg Arg Leu He Glu Pro Ala Lys Arg Arg Leu Glu Ala Leu Lys 

260 265 270 

Glu Ser Glu Ala Phe Ser Arg Ala Val He Gin Ala Ala Pro Val Ala 

275 280 285 

Leu Cys Val Leu Arg Arg Ala Asp Ala Ala Val Val Leu Glu Asn Pro 

290 295 300 

Gin Ala Arg Gin Trp Leu Gly Asp Ser Glu Ala He Ala His Asp Ala 
305 310 315 320 

Pro Arg Trp He Ser Gin Ala Phe Ala Gly Gly Val Lys Cys Ser Gly 

325 330 335 

Glu Glu Leu Glu Thr Glu Ala Gly Leu His Leu His" Leu Asn Tyr Thr 

340 345 350 

Pro Thr Arg Tyr Asn Gly Glu Asp Val Leu Phe Cys Ala Phe Ser Glu 

355 ~ 360 365 

He Ser Ala Arg Lys Arg Met Glu Ala Glu Leu Ala Arg Ala Lys Ser 

370 375 380 

Leu Ala Asp Ala Ala Asn Glu Ala Lys Thr Leu Phe Leu Ala Thr Met 
385 390 395 400 

Ser His Glu He Arg Thr Pro Leu Tyr Gly Met Leu Gly Thr Leu Glu 

405 410 415 

Leu Leu Gly Arg Thr Glu Leu Ser Arg Gin Gin Ala Gly Tyr Leu Lys 

420 425 430 

Ala He Gin His Ser Ser Ser Thr Leu Leu Gin Leu He Ser Asp Val 

435 440 445 

Leu Asp Val Ser Lys He Glu Ala Gly Gin Leu Asp Leu Glu Cys Val 
450 455 460 



Leo,/, 
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Glu Phe Ser 
465 

Gly Ala Ala 

Glu Leu Pro 

Leu Asn Asn 
515 

Val Asn Val 

530 
Leu Thr Trp 
545 

Gin Pro Arg 

Pro Val Ala 

Gin Leu Met 
595 

Ser Ser Phe 
610 

Glu Pro Gin 
625 

Arg Asp Leu 

Arg Ala Met 

Ser Leu Leu 
675- 

Ala Trp Pro 

690 
Pro Gin Ala 
705 

Gly Leu His 

Ser Thr Pro 

Val Leu Val 
755 

Gin Met Glu 
770 

Glu Ala Leu 
785 

Asp lie Asn. 

Arg Arg Gin 

Met Arg Glu 
835 

Leu Val Lys 

850 
lie Leu Lys 
865 



Pro Leu Glu Leu Thr Glu Glu Val Val Gin Ser Phe Thr 
470 475 480 

Gin Ala Lys Gly Leu Gin Leu Tyr Thr Cys Leu Ser Ala 

485 . 490 495 

Leu Arg Met Arg Gly Ala Ala Ala Ser lie Arg Gin He 
500 505 510 

Leu Leu Ser Asn Ala Val Lys ■ Phe Thr Asp Asn Gly Tyr 

520 525 
His Leu Lys Ala Ser Val Val Asp Ala Glu Cys Val Met 

535 540 
Gin Val Asn Asp Thr Gly Met Gly He Asn Val Glu Asp 
550 555 560 

Leu Phe Glu Pro Phe Tyr Gin He Arg Arg Ser Glu His 

565 570 575 

Gly Thr Gly Leu Gly Leu Ser He Ser Gin Arg Leu Ala 
580 585 590 

Asn Gly Ser Leu Lys Leu Val Ser Glu Leu Gly Leu Gly 

600 605 
Ser Leu Arg Leu Pro Leu Glu Arg He Ala Met Gin Ala 

615 620 
Asp Leu Ala Gly Cys Ala Val Gin Val Leu Ala Pro Val 
630 635 640 

Thr Glu Cys Leu Cys Gly Trp He Ser Arg Trp Gly Gly 

645 650 655 

Val Ala Thr Pro Arg Ser Leu Asp Glu Ala Asp Ala Thr 
660 665 670 

Val Glu Val Leu Leu Leu Glu Gly Ala Pro Met Phe Glu 

660 685 
Gly Cys Arg Val Glu Leu Ser Pro Gin Gly Asp Met Glu 

695 700 
Gin Gly Arg Asp Trp Leu Leu Gly Leu Asn Asn Leu Asp 
710 ~ 715 720 

Arg Ala Leu Gly Leu Ala His Gly Arg Leu Ala Asp Pro 

725 " 730 735 

Pro He Arg Leu Ala Pro Leu Arg Asn Leu Gly Leu Arg 
740 ~ 745 750 

Val Glu Asp Asn Ala He Asn Gin Leu He Leu Arg Asp 

760 765 
Ala Leu Gly Cys Ser Val Glu Leu Leu Phe Asp Gly Arg 

775 780 
Leu His Cys Gin Thr Ala Cys Phe Asp Val Val Leu Thr 
790 795 800 

Met Pro Asn Met Asn Gly Tyr Glu Leu Thr Ala Glu Leu 

805 810 815 

Gly Phe Arg Gin Pro He He Gly Ala Thr Ala Asn Ala 
820 " 825 830 

Glu Arg Glu Arg Cys Met Ser Ala Gly Met Asn Asp Cys 

.840 845 
Pro Val Asp Leu Asn Ala Leu Gin Asn Cys Leu He Asn 
855 860 

Val Asp ' Arg 
870 



OQ/v 
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Figure 6R 

0RF1-7 

SEQ ID NO: 25 

Met Tyr Leu Val Asp Thr Asp Lys Gly Pro Leu Val Tyr Arg Leu Thr 

! 5 10 15 

Ala Asp Gly Arg Pro Ser Ala Ala He Ser Ser Thr He Thr Lys Glu 

20 25 30 

Val Tyr Arg Ala Leu Leu Ala Thr Pro Ser Ala Pro Val His Trp Val 

35 40 45 

Thr Asp Gly Gly Thr Pro Gin Arg Leu Tyr Leu Phe Glu Ser Leu Gly 

50 55 60 

Asp Glu Pro Gly Glu Gly Trp Leu Gly Leu Glu He Leu Gly Glu Asp 
65 70 75 80 

Leu Asp Ser Met Leu Arg Arg Asn Asp Ala Gly Asn Tyr Met Leu Leu 

85 90 95 

Asp Gin His Gly Gin Val Val Leu Ala Thr Asp Ala Glu Ala Leu Gly 

100 " 105 HO 

Ser Gly Ala Ser Arg Thr Leu Leu Arg Gly Asp Gly Phe Gly Phe He 

115 120 125 

Gly Ala Gly Pro Leu Pro Gin His Met Val Leu Phe Gin His Val Gly 

130 135 140 

Ser Ser Ser Trp Asp Leu He Tyr His He Gly He Gly Arg Leu Leu 
145 150 155 160 

Leu Ala Leu Trp Leu Pro Leu Leu Leu Ala Ser Ala Leu Ala Leu Ala 

165 170 175 

Val Gly He Leu Leu His Trp Leu Val Arg Ser He Glu Arg Arg Leu 

180 185 190 

He Glu Pro Ala Lys Arg Arg Leu Glu Ala Leu Lys Glu Ser Glu Ala 

195 ~ 200 205 

Phe Ser Arg Ala Val He Gin Ala Ala Pro Val Ala Leu Cys Val Leu 

210 215 220 

Arg Arg Ala Asp Ala Ala Val Val Leu Glu Asn Pro Gin Ala Arg Gin 
225 230 235 240 

Trp Leu Gly Asp Ser Glu Ala He Ala His Asp Ala Pro Arg Trp He 

245 250 255 

Ser Gin Ala Phe Ala Gly Gly Val Lys Cys Ser Gly Glu Glu Leu Glu 

260 * 265 270 

Thr Glu Ala Gly Leu His Leu His Leu Asn Tyr Thr Pro Thr Arg Tyr 

275 280 285 

Asn Gly Glu Asp Val Leu Phe Cys Ala Phe Ser Glu He Ser Ala Arg 

290 295 300 

Lys Arg Met Glu Ala Glu Leu Ala Arg Ala Lys Ser Leu Ala Asp Ala 
305 310 315 320 

Ala Asn Glu Ala Lys Thr Leu Phe Leu Ala Thr Met Ser His Glu He 

325 330 335 

Arg Thr Pro Leu Tyr Gly Met Leu Gly Thr Leu Glu Leu Leu Gly Arg 

340 ~ " 345 350 

Thr Glu Leu Ser Arg Gin Gin Ala Gly Tyr Leu Lys Ala He Gin His 

355 360 365 

Ser Ser Ser Thr Leu Leu Gin Leu He Ser Asp Val Leu Asp Val Ser 

370 , 375 380 

Lys He Glu Ala Gly Gin Leu Asp Leu Glu Cys Val Glu Phe Ser Pro 
385 390 * 395 400 

Leu Glu Leu Thr -Glu Glu Val Val Gin Ser Phe Thr Gly Ala Ala Gin 

405 410 415' 

Ala Lys Gly Leu Gin Leu Tyr Thr Cys Leu Ser Ala Glu Leu Pro Leu 

420 425 430 

Arg Met Arg Gly Ala Ala Ala Ser He Arg Gin He .Leu Asn Asn Leu 

435 440 445 

Leu Ser Asn Ala Val Lys Phe Thr Asp Asn Gly Tyr Val Asn Val His 
450 * 455 ' 460 
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Leu 


Lys 


Ala 


Ser 


Val 


Val 


Asp Ala 


Glu Cys Val Met Leu Thr Trp Gin 


465 








470 




475 480 


Val 


Asn 


Asp 


Thr 


Gly 


Met 


Gly He 


Asn Val Glu Asp Gin Pro Arg Leu 






485 






490 495 


Phe 


Glu 


Pro 


Phe 


Tyr. 


Gin 


He Arg 


Arg Ser Glu His Pro Val Ala Gly 






500 








505 510 


Thr 


Gly 


Leu 


Gly 


Leu 


Ser 


He Ser 


Gin Arg Leu Ala Gin Leu Met Asn 


515 








520 


525 


Gly 


Ser 


Leu 


Lys 


Leu 


Val 


Ser Glu 


Leu Gly Leu Gly Ser Ser Phe Ser 


530 










535 


540 


Leu 


Arg 


Leu 


Pro 


Leu 


Glu 


Arg He 


Ala Met Gin Ala Glu Pro Gin Asp 


545 








550 




555 560 


Leu 


Ala 


Gly 


Cys 


Ala 


Val 


Gin Val 


Leu Ala Pro Val Arg Asp Leu Thr 




565 






570 575 


Glu 


Cys 


Leu 


Cys 


Gly 


Trp 


He Ser 


Arg Trp Gly Gly Arg Ala Met Val 






580 








585 590 


Ala 


Thr 


Pro 


Arg 


Ser 


Leu 


Asp Glu 


Ala Asp Ala Thr Ser Leu Leu Val 




595 






600 


605 


Glu 


Val 


.Leu 


Leu 


Leu 


Glu 


Gly Ala 


Pro Met Phe Glu Ala Trp Pro Gly 


610 










615 


620 




ArQ 


Val 


Glu 


Leu 


Ser 


Pro Gin 


Gly Asp Met Glu Pro Gin Ala Gin 


625 








630 




635 640 


Glv 


Arg 


Asp 


Trp 


Leu 


Leu 


Gly Leu 


Asn Asn Leu Asp Gly Leu His Arg 






645 






650 655 


Ala 


Leu 


Gly 


Leu 


Ala 


His 


Gly Arg 


Leu Ala Asp Pro Ser Thr Pro Pro 






660 








665 670 


lie 


Arci 


Leu 


Ala 


Pro 


Leu 


Arg Asn 


Leu Gly Leu Arg Val Leu Val Val 




675 








680 


685 


Glu 


Asp 


Asn 


Ala 


lie 


Asn 


Gin Leu 


He Leu Arg Asp Gin Met Glu Ala 




690 










695 


700 


Leu 


Gly 


Cys 


Ser 


Val 


Glu 


Leu Leu 


Phe Asp Gly Arg Glu Ala Leu Leu 


7 05 






710 




715 720 


His 


Cys 


Gin 


Thr 


Ala 


Cys 


Phe Asp 


Val Val Leu Thr Asp He Asn Met 








725 






730 735 


Pro 


Asn 


Met 


Asn 


Gly 


Tyr 


Glu Leu 


Thr Ala Glu Leu Arg Arg Gin Gly 








740 








745 750 


Phe 


Arg 


Gin 


Pro 


He 


He 


Gly Ala 


Thr Ala Asn Ala Met Arg Glu Glu 




755 








760 


765 


Arg 


Glu 


Arg 


Cys 


Met 


Ser 


Ala Gly 


Met Asn Asp Cys Leu Val Lys Pro 


770 










775 


780 


Val 


Asp 


Leu 


Asn 


Ala 


Leu 


Gin Asn 


Cys Leu He Asn He Leu Lys Val 


78.5 








790 




795 800 



Asp Arg 
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Figure 6S 

ORF1-8 

SEQ ID NO; 26 

Met Leu Arg Arg Asn Asp Ala Gly Asn Tyr Met Leu Leu Asp Gin His 

1 5 10 15 

Gly Gin Val Val Leu Ala Thr Asp Ala Glu Ala Leu Gly Ser Gly Ala 

20 25 30 

Ser Arg Thr Leu Leu Arg Gly Asp Gly Phe Gly Phe He Gly Ala Gly 

35 40 . 45 

Pro Leu Pro Gin His Met Val Leu Phe Gin His Val Gly Ser Ser Ser 

50 55 60 

Trp Asp Leu He Tyr His He Gly He Gly Arg Leu Leu Leu Ala Leu 
65 70 75 80 

Trp Leu Pro Leu Leu Leu Ala Ser Ala Leu Ala Leu Ala Val Gly He 

85 90 95 

Leu Leu His Trp Leu Val Arg Ser He Glu Arg Arg Leu He Glu Pro 

100 . • 105 110 

Ala Lys Arg Arg Leu Glu Ala Leu Lys Glu Ser Glu Ala Phe Ser Arg 

115 120 . . 125 

Ala Val He Gin Ala Ala Pro Val Ala Leu Cys Val Leu Arg Arg Ala 

130 135 140 

Asp Ala Ala Val Val Leu Glu Asn Pro Gin Ala Arg Gin Trp Leu Gly 
145 150 155 160 

Asp Ser Glu Ala -He Ala His Asp Ala Pro Arg Trp He Ser Gin Ala 

165 170 175 

Phe Ala Gly Gly Val Lys Cys Ser Gly Glu Glu Leu Glu Thr Glu Ala 

180 185 190 

Gly Leu His Leu His Leu Asn Tyr Thr Pro Thr Arg Tyr Asn Gly Glu 

195 200 205 

Asp Val Leu Phe Cys Ala Phe Ser Glu He Ser Ala Arg Lys Arg Met 

210 215 220 

Glu Ala Glu Leu Ala Arg Ala Lys Ser Leu Ala Asp Ala Ala Asn Glu 
225 230 235 240 

Ala Lys Thr Leu Phe Leu Ala Thr Met Ser His Glu He Arg Thr Pro 

245 250 255 

Leu Tyr Gly Met Leu Gly Thr Leu Glu Leu Leu Gly Arg Thr Glu Leu 

260 ' 265 270 

Ser Arg Gin Gin Ala Gly Tyr Leu Lys Ala He Gin His Ser Ser Ser 

275 280 .285 

Thr Leu Leu Gin Leu He Ser Asp Val Leu Asp Val Ser Lys lie Glu 

290 295 300 

Ala Gly Gin Leu Asp Leu Glu Cys Val Glu Phe Ser Pro Leu Glu Leu 
305 310 315 320 

Thr Glu Glu Val Val Gin Ser Phe Thr Gly Ala Ala Gin Ala Lys Gly 

325 330 335 

Leu Gin Leu Tyr Thr Cys Leu Ser Ala Glu Leu Pro* Leu Arg Met Arg 

340 345 350 

Gly Ala Ala Ala Ser lie Arg Gin lie Leu Asn Asn Leu Leu Ser Asn 

355 • 360 365 

Ala Val Lys Phe Thr Asp Asn Gly Tyr Val Asn Val His Leu Lys Ala 

370 375 380 

Ser Val Val Asp Ala Glu Cys Val Met Leu Thr Trp Gin Val Asn Asp 
385 390 395 .400 

Thr Gly Met Gly He Asn Val Glu Asp Gin Pro Arg Leu Phe Glu Pro. 

405 410 415 

Phe Tyr Gin lie Arg Arg Ser Glu His Pro Val Ala Gly Thr Gly Leu 

420 425 430 

Gly Leu Ser lie Ser Gin Arg Leu Ala Gin Leu Met Asn Gly Ser Leu 

435 440 445 

Lys Leu Val Ser Glu Leu Gly Leu Gly Ser Ser Phe Ser Leu Arg Leu 
450 455 460 
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Pro Leu 
465 

Cys Ala 

Cys Gly 

Arg Ser 

Leu Leu 
530 
Glu Leu 
545 

Trp Leu 

Leu Ala 

Ala Pro 

Ala lie 
610 
Ser Val 
625 

Thr Ala 

Asn- Gly 

Pro He 

Cys Met 
630 
Asn Ala 
705 



Glu Arg He Ala Met Gin Ala Glu Pro Gin Asp Leu Ala <5ly 
470 475 480 

Val Gin Val Leu Ala Pro Val Arg Asp Leu Thr Glu Cys Leu 

485 490 495 

Trp He Ser Arg Trp Gly Gly Arg Ala Met Val Ala Thr Pro 

500 ~ " 505 510 

Leu Asp Glu Ala Asp Ala Thr Ser Leu Leu Val Glu Val Leu 
515 520 525 

Glu Gly Ala Pro Met Phe Glu Ala Trp Pro Gly Cys Arg Val 

535 540 
Ser Pro Gin Gly Asp Met Glu Pro Gin Ala Gin Gly Arg Asp 
550 555 560 

Leu Gly Leu Asn Asn Leu Asp Gly Leu His Arg Ala Leu Gly 

565 570 575 

His Gly Arg Leu Ala Asp Pro Ser Thr Pro Pro He Arg Leu 

580 585 590 

Leu Arg Asn Leu Gly Leu Arg Val Leu Val Val Glu Asp Asn 
595 600 605 

Asn Gin Leu He Leu Arg Asp Gin Met Glu Ala Leu Gly Cys 

615 620 
Glu Leu Leu Phe Asp Gly Arg Glu Ala Leu Leu His Cys Gin 
630 635 640 

Cys Phe Asp Val Val Leu Thr Asp lie Asn Met Pro Asn Met 

645 650 . .655 

Tyr Glu Leu Thr Ala Glu Leu Arg Arg Gin Gly Phe Arg Gin 

660 665 670 

He Gly Ala Thr Ala Asn Ala Met Arg Glu Glu Arg Glu Arg 
675 680 685 

Ser Ala Gly Met Asn Asp Cys Leu Val Lys Pro Val Asp Leu 

695 700 
Leu Gin Asn Cys Leu He Asn He Leu Lys Val Asp Arg 
710 .715 
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Figure 6T 
ORF1-9 



Met 


Leu 


Leu 


Asp 


Gin 


His 


Gly 


Gin Val Val Leu Ala Thr Asp Ala Glu 


1 






5 






10 15 


Ala 


Leu 


Gly 


Ser 


Gly 


Ala 


Ser 


Arg Thr Leu Leu Arg Gly Asp Gly Phe 






20 






25 30 


Gly 


Phe 


He 


Gly 


Ala 


Gly 


Pro 


Leu Pro Gin His Met Val Leu Phe Gin 




35 








40 45 


His 


Val 


Gly 


Ser 


Ser 


Ser 


Trp 


Asp Leu He Tyr His He Gly He Gly 




50 








55 


60 


Arg 


Leu 


Leu 


Leu 


Ala 


Leu 


Trp 


Leu Pro Leu Leu Leu Ala Ser Ala Leu 


65 










70 




75 80 


Ala 


Leu 


Ala 


Val 


Gly 


He 


Leu 


Leu His Trp Leu Val Arg Ser He Glu 










85 






90 95 


Arcj 


Arq 
**** 3 


Leu 


He 


Glu 


Pro 


Ala 


Lys Arg Arg Leu Glu Ala Leu Lys Glu 




100 








105 ~ HO 


Ser 


Glu 


Ala 


Phe 


Ser 


Arg 


Ala 


Val He Gin Ala Ala Pro Val Ala Leu 






.115 










120 125 


Cvs 


Val 


Leu 


Arq 


Arg 


Ala 


Asp 


Ala Ala Val Val Leu Glu Asn Pro Gin 


130 










135 


140 


Ala 


ArQ 


Gin 


Trp 


Leu 


Gly 


Asp 


Ser Glu Ala He Ala His Asp Ala Pro 


145 








150 




155 160 


At* ci 




He 


Ser 


Gin 


Ala 


Phe 


Ala Gly Gly Val Lys Cys Ser Gly Glu 






165 






170 175 


Glu 


Leu 


Glu 


Thr 


Glu 


Ala 


Gly 


Leu His Leu His Leu Asn Tyr Thr Pro 








180 








185 190 


Thr 


Ara 


Tvr 


Asn 


Glv 


Glu 


Asp 


Val Leu Phe Cys Ala Phe Ser Glu He 




195 










200 205 


Ser 


Ala 


Arq 


Lvs 


Arg 


Met 


Glu 


Ala Glu Leu Ala Arg Ala Lys Ser Leu 




210 










215 


220 


Ala 


Asd 


Ala 


Ala 


Asn 


Glu 


Ala 


Lys Thr Leu Phe Leu Ala Thr Met Ser 


225 








230 




235 240 


His 


Glu 


He 


Ara 


Thr 


Pro 


Leu 


Tyr Gly Met Leu Gly Thr Leu Glu Leu 








245 






250 255 


Leu 


Glv 


Arq 


Thr 


Glu 


Leu 


Ser 


Arg Gin Gin Ala Gly Tyr Leu Lys Ala 






260 








265 270 


He 


Gin 


His 


Ser 


Ser 


Ser 


Thr 


Leu Leu Gin Leu He Ser Asp Val Leu 






275 










280 285 


Asp 


Val 


Ser 


Lys 


He 


Glu 


Ala 


Gly Gin Leu Asp Leu Glu Cys Val Glu 


290 










295 


3 00 


Phe 


Ser 


Pro 


Leu' 


Glu 


Leu 


Thr 


Glu Glu Val Val Gin Ser Phe Thr Gly 


305 










310 




315 320 


Ala 


Ala 


Gin 


Ala 


Lvs 


Gly 


Leu 


Gin Leu Tyr Thr Cys Leu Ser Ala Glu 










325 






330 335 


Leu 


Pro 


Leu 


Arq 


Met 


Arq 


Gly 


Ala Ala Ala Ser He Arg Gin He Leu 








340 








345 350 




Asn 


Leu 


Leu 


Ser 


Asn 


Ala 


Val Lys Phe Thr Asp Asn Gly Tyr Val 






355 










360 365 


Ash 


Val 


His 


Leu 


Lvs 


Ala 


Ser 


Val Val Asp Ala Glu Cys Val Met Leu 




370 










375 


380 


Thr 


Trp 


Gin 


Val 


Asn 


Asp 


Thr 


Gly Met Gly He Asn Val Glu Asp Gin 


385 








390 




395 400 


Pro 


Arg 


Leu 


Phe 


Glu 


Pro 


Phe 


Tyr Gin He Arg Arg Ser Glu His Pro 








405 






410 415 . • 


Val 


Ala 


Gly 


Thr 


Gly 


Leu 


Gly 


Leu Ser lie Ser Gin Arg Leu Ala Gin 








420 








425 430 


Leu 


Met 


Asn 


Gly 


Ser 


Leu 


Lys 


Leu Val Ser Glu Leu Gly Leu Gly Ser 






435 










440 445 


Ser 


Phe 


Ser 


Leu 


Arg 


Leu 


Pro 


Leu Glu Arg He Ala Met Gin Ala Glu 




450 










455 


460 



1*17 > 
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Pro 


Gin 


Asn 


Leu 


Ala 


Gly 


Cys 


Ala 


Val Gin 


Val Leu Ala Pro Val Arg 


a a ^ 








470 








475 480 


Asp 


uc w 


Thr 


Glu 


Cys 


Leu 


Cvs 


Gly 


Trp lie 


Ser Arg Trp Gly Gly Arg 








485 








490 


495 


nXa 


Mpt- 


Val 


Ala 
500 


Thr 


Pro 


Arc? 


Ser 


Leu Asp 
505 


Glu Ala Asp Ala Thr Ser 
510 


Leu 


T.on 


V Cll 


Glu 


Val 


Leu 


Leu 


Leu 


Glu Gly 


Ala Pro Met Phe Glu Ala 




m 5 










520 




525 


Trp 


Pro 




Cys 


Arg 


Val 


Glu 


Leu 


Ser Pro 


Gin Gly Asp Met Glu Pro 


3 O U 










J J J 






540 


Gin 


Ala 


Gin 


Gly 


Arg 


Asp 


Trp 


Leu 


Leu Gly 


Leu Asn Asn Leu Asp Gly 


545 










J JU 








555 560 
Arg Leu Ala Asp Pro Ser 


Leu 


HIS 




r\± d 


Leu 


fil V 
V3J. jr 


Leu 


Ala 


His Gly 








C £ c 

JO J 








570 


575 


Thr 


Pro 


pro 


lie 


Arg 


Leu 


Ala 


Pro 


Leu Arg 


Asn Leu Gly Leu Arg Val 






c d r» 








5R5 


590 


Leu 


vax 


Vol 


Ul u 




Asn 


Ala 


lie 


Asn Gin 


Leu He Leu Arg Asp Gin 






c o c 








vUv 




605 


Met 


b±U 

610 








v-ys 


Cor 


Val 


Glu lieu 


Leu Phe Asp Gly Arg Glu 
620 


Ala 


Leu 


Leu 


His 


Cys 


Gin 


Thr 


Ala 


Cys Phe 


Asp Val Val Leu Thr Asp 


625 








fi^ 0 

DOv 








635 640 


Tie* 


Asn 


Met 


Pro 


Asn 
645 


Met 


Asn 


Gly 


Tyr Glu 
650 


Leu Thr Ala Glu Leu Arg 
655 


Arg 


Gin 


Gly 


Phe 


Arg 


Gin 


Pro 


He 


He Gly 


Ala Thr Ala Asn Ala Met 






660 










665 


670' 


Arg 


Glu 


Glu 


Arg 


Glu 


Arg 


Cys 


Met 


Ser Ala 


Gly Met Asn Asp Cys Leu 




675 










680 




685 


Val 


Lys 
690 


Pro 


Val 


Asp 


Leu 


Asn 
695 


Ala 


Leu Gin 


Asn Cys Leu He Asn He 
700 


Leu 


Lys 


Val 


Asp 


Arg 













705 / 



i/r/v 
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Figure 6U 

ORF1-10 

SEQ ID NO:28 

Met Val Leu Phe Gin His Val Gly Ser Ser Ser Trp Asp Leu lie Tyr 

! 5 10 15 

His He Gly He Gly Arg Leu Leu Leu Ala Leu Trp Leu Pro Leu Leu 

20 25 . 30 

Leu Ala Ser Ala Leu Ala Leu Ala Val Gly He Leu Leu His Trp Leu 

35 40 45 

Val Arg Ser He Glu Arg Arg Leu lie Glu Pro Ala Lys Arg Arg Leu 

50 55 60 

Glu Ala Leu Lys Glu Ser Glu Ala Phe Ser Arg Ala Val He Gin Ala 
65 ' 70 75 80 

Ala Pro Val Ala Leu Cys Val Leu Arg Arg Ala Asp Ala Ala Val Val 

85 90 . 95 

Leu Glu Asn Pro Gin Ala Arg Gin Trp Leu Gly Asp Ser Glu Ala He 

100 105 HO 

Ala His Asp Ala Pro Arg Trp He Ser Gin Ala Phe Ala Gly Gly Val 

115 120 125 

Lys Cys Ser Gly Glu Glu Leu Glu Thr Glu Ala Gly Leu His Leu His 

130 135 140 

' Leu Asn Tyr Thr Pro Thr Arg Tyr Asn Gly Glu Asp Val Leu Phe Cys 
145 150 155 160 

Ala Phe Ser Glu He Ser Ala Arg Lys Arg Met Glu Ala Glu Leu Ala 

165 170 175 

Arg Ala Lys Ser Leu Ala Asp Ala Ala Asn Glu Ala Lys Thr Leu Phe 

180 185 190 

Leu Ala Thr Met Ser His Glu He Arg Thr Pro Leu Tyr Gly Met Leu 

195 200 205 

Gly Thf Leu Glu Leu Leu Gly Arg Thr Glu Leu Ser Arg Gin Gin Ala 

210 215 220 

Gly Tyr Leu Lys Ala He Gin His Ser Ser Ser Thr Leu Leu Gin Leu 
225 230 235' 240 

He Ser Asp Val Leu Asp Val Ser Lys He Glu Ala Gly Gin Leu Asp 

245 250 255 

Leu Glu Cys Val Glu Phe Ser Pro Leu Glu Leu Thr Glu Glu Val Val 

260 265 270 

Gin Ser Phe Thr Gly Ala Ala Gin Ala Lys Gly Leu Gin Leu Tyr Thr 

275 280 285 

Cys Leu Ser Ala Glu Leu Pro Leu Arg Met Arg Gly Ala Ala Ala Ser 

290 295 300 

He Arg Gin lie Leu Asn Asn Leu Leu Ser Asn Ala Val Lys Phe Thr 
305 310 315 320 

Asp Asn Gly Tyr Val Asn Val His Leu Lys Ala Ser Val Val Asp Ala 

325 330 335 

Glu Cys Val Met Leu Thr Trp Gin Val Asn Asp Thr Gly Met Gly lie 

340 345 350 

Asn Val Glu Asp Gin Pro Arg Leu Phe Glu Pro Phe "Tyr Gin lie Arg 

355 360 365 

Arg Ser Glu His Pro Val Ala Gly Thr Gly Leu Gly Leu Ser He Ser 

370 375 380 

Gin Arg Leu Ala Gin Leu Met Asn Gly Ser Leu Lys Leu Val Ser Glu 
385 390 395 400 

Leu Gly Leu Gly Ser Ser Phe Ser Leu Arg Leu Pro Leu Glu Arg lie 

405 410 415 

Ala Met Gin Ala Glu Pro Gin Asp Leu Ala Gly Cys Ala Val Gin Val 

420 425 430 

Leu Ala Pro Val Arg Asp Leu Thr Glu Cys Leu Cys Gly Trp He Ser 
435 440 445 

Arg Trp Gly Gly Arg Ala Met Val Ala Thr Pro Arg Ser Leu Asp Glu 
450 455 460 
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Met 
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645 


Cys 


Leu 


lie 
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lie 



660 



Leu Leu Val Glu Val Leu Leu Leu Glu Gly Ala 
470 475 480 

Trp Pro Gly CyB Arg Val Glu Leu Ser Pro Gin 

490 495 
Gin Ala Gin Gly Arg Asp Trp Leu Leu Gly Leu 

505 510 
Leu His Arg Ala Leu Gly Leu Ala His Gly Arg 

520 525 
Thr Pro Pro He Arg Leu Ala Pro Leu Arg Asn 

535 540 
Leu Val Val Glu Asp Asn Ala He Asn Gin Leu 
550 555 560 

Met Glu Ala Leu Gly Cys Ser Val Glu Leu Leu 

570 575 
Ala Leu Leu His Cys Gin Thr Ala Cys Phe Asp 

585 590 
He Asn Met Pro Asn Met Asn Gly Tyr Glu Leu 

600 605 
Arg Gin Gly Phe Arg Gin Pro He He Gly Ala 

615 620 
Arg Glu Glu Arg Glu Arg Cys Met Ser Ala Gly 
630 ~ 635 640 

Val Lys Pro Val Asp Leu Asn Ala Leu Gin Asn 

650 655 
Leu Lys Val Asp Arg 
665 
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Figure . 6V 

ORF1-11 

SEQ ID NO:29 

Met Glu Ala Glu Leu Ala Arg Ala Lys Ser Leu Ala Asp Ala Ala Asn 

1 5 10 15 

Glu Ala Lys Thr Leu Phe Leu Ala Thr Met Ser His Glu He Arg Thr 

20 25 30 

Pro Leu Tyr Gly Met Leu Gly Thr Leu Glu Leu Leu Gly Arg Thr Glu 

35 40 45 

Leu Ser Arg Gin Gin Ala Gly Tyr Leu Lys Ala He Gin His Ser Ser 

50 55 60 

Ser Thr Leu Leu Gin Leu He Ser Asp Val Leu Asp Val Ser Lys He ■ 
65 70 75 80 

Glu Ala Gly Gin Leu Asp Leu Glu Cys Val Glu Phe Ser Pro Leu Glu 

85 90 95 

Leu Thr Glu Glu Val Val Gin Ser Phe Thr Gly Ala Ala Gin Ala Lys 

100 105 HO 

Gly Leu Gin Leu Tyr Thr Cys Leu Ser Ala Glu Leu Pro Leu Arg Met 

115 120 125 

Arg Gly Ala Ala Ala Ser He Arg Gin He Leu Asn Asn Leu Leu Ser 

130 135 140 

Asn Ala Val Lys Phe Thr Asp Asn Gly Tyr Val Asn Val His Leu Lys 
145 150 155 160 

Ala Ser Val Val Asp Ala Glu Cys Val Met Leu Thr Trp Gin Val Asn 

165 170 175 

Asp Thr Gly Met Gly He Asn Val Glu Asp Gin Pro Arg Leu Phe Glu 

180 185 190 

Pro Phe Tyr Gin He Arg Arg Ser Glu His Pro Val Ala Gly Thr Gly 

195 '200 205 

Leu Gly Leu Ser He Ser Gin Arg Leu Ala Gin Leu Met Asn Gly Ser 

210 215 220 

Leu Lys Leu Val Ser Glu Leu Gly Leu Gly Ser Ser Phe Ser Leu Arg 
225 230 235 240 

Leu Pro Leu Glu Arg He Ala Met Gin Ala Glu Pro Gin Asp Leu Ala 

245 250 255 

Gly Cys Ala Val Gin Val Leu Ala Pro Val Arg Asp Leu Thr Glu Cys 

260 265 270 

Leu Cys Gly Trp He Ser Arg Trp Gly Gly Arg Ala Met Val Ala Thr 

275 280 285 

Pro Arg Ser Leu Asp Glu Ala Asp Ala Thr Ser Leu Leu Val Glu Val 

290 ~ 295 300 

Leu Leu Leu Glu Gly Ala Pro Met Phe Glu Ala Trp Pro Gly Cys Arg 
305 310 315 320 

Val Glu Leu Ser Pro Gin Gly Asp Met Glu Pro Gin Ala Gin Gly Arg 

325 330 335 

Asp Trp Leu Leu Gly Leu Asn Asn Leu Asp Gly Leu His Arg Ala Leu 

340 345 350 

Gly Leu Ala His Gly Arg Leu Ala Asp Pro Ser Thr Pro Pro He Arg 

355 360 -365 

Leu Ala Pro Leu Arg Asn Leu Gly Leu Arg Val Leu Val Val Glu Asp 

370 ~ 375 380 

Asn Ala He Asn Gin Leu He Leu Arg Asp Gin Met Glu Ala Leu Gly 
385 390 395 400 

Cys Ser Val Glu Leu Leu Phe Asp Gly Arg Glu Ala Leu Leu His Cys 

405 410 415 

Gin Thr Ala Cys Phe Asp Val Val Leu Thr Asp He Asn Met Pro Asn 

420 425 430 

Met Asn Gly Tyr Glu Leu Thr Ala Glu Leu Arg Arg Gin Gly Phe Arg 

435 440 445 

Gin Pro He He Gly Ala Thr Ala Asn Ala Met Arg Glu Glu Arg Glu 
450 455 460 

k\)/\ 
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Ara Cvs Met Ser Ala Gly Met Asn Asp Cys Leu Val Lys Pro Val Asp 
46 5 470 475 480 

Leu Asn Ala Leu Gin Asn Cys Leu lie Asn He Leu Lys Val Asp Arg 
485 490 495 
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ORF3-2 

SEQ ID NO:30 

atggatgtta tacgggagca tgaggtattt 
accacccaga agtacgacta tgacgtggtg 
aacggattag cggtctatga gggacgggag 
accaagcacg cgttgagcgc cgattcctcg 
gccaatttct acggaagctt ctggagtgtt 
tttgatcttt ccggcagcac ccgcctggca 
gacaggttga gcggaagcta tccgatgata 
cggccggtgg gggaggacgc tcagcgtgtc 
tcggcgctgg agatgttggg agtcgcccgg 
gacgagccga accatctgat catcgctgcg 
ttcgaacagt tggttgagcg cccggcattc 
gaggtattgc tcggcgcggc ccctgcgacc 
cagggggtcg ccgttcaact gcgcagccag 
accgactacg gcaatttctt tcgccactcc 
ccggcgctgc tcctggccgg ttggctcggg 
ccggtgcatc gggcgcaccg gcaactggtg 
cagaccgcgc cggtggctct ggtggtgctg 
aaccacttgg ccgcccagtg gctgggcggg 
tggaagcttt tcgatgcgcg tgggcaggta 
cgctatttgc agaccgcctt cgcggcgacc 
gtattcaacg acatcacggt ccactgcgag 
gcagcggatg ccgccagcca ggccaagacc 
cgtactcccc tgtacggtgt ccttggcacc 
gagcggcaac gcgcctacct acgcaccatc 
attagcgatg tgctggatgt ctcgaagatc 
gccttcaatc cgctggacct agtgcgggaa 
gccaaggacc tgcaggtaga cccgctcgat 
ggcttcgaag aaagcgttct gttcgaggtt 
ggtgtcgtcg gcgttgtcga acaacgcctg 
gtcgcgcacc tgcacgagga tgaccggcag 
ggaagcgacc ctggtcaggt gcaccacatt 
accctcgcgg ccgcgcatgg aatggcaaaa 
cgtaacgtca atttccaggc gtcaaaaaca 
aaatctagat ag 
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Figure 7a 



cttgggcgca tcgctcgaaa aagcgacaag 60 
cctttgcagc ggcacttgtt ggcaaaggaa 120 
ttttcctttg ctatgccatt tctactggct 180 
ggagatccgt tttcgctcgg tgtattgctc 240 
tccgcctatc ccgcgccaca gttactgatc 300 
gtgccgtcga ttccctccac agcgcagcgt 360 
gtcgagcgca ttctggcgcg cttgcgcacc 420 
cattggatac gcgctgatcg ctatcgcgac 480 
gttgatctgc cggaaacact ctggtggcac 540 
agcctgcttg atctcaggcg aatcaatgac 600 
gattcgtaca gcctggtatc gccggatggc 660 
ggcctgaggg atggcctgaa cctcacccga 720 
cctgagaacg gctggctcgc ggtctaccga 780 
cggtggctgg tggcaggtct gctgctgacc 840 
atgcgttggt acaccagcag cgtcgtcaac 900 
gagagcgaca ccttcagccg gacgctgata 960 
acccaggatg accagcaact ggtgacctgc 1020 
cccacggaga tccttgggct gacttccaac 1080 
ccaggagaca tctgtatcca ggtcggtggg 1140 
cgctatgccg gcaccgaggc ggtactgtgc 1200 
gcggagaccg cgctgtccaa tgcgaagcga 1260 
ctgttcctgg cccgcatgag ccatgaaatc 1320 
ctggagttgc tcgacctgac caccctgaac 13 80 
cagagttcgt ctgcgacgct catgcaactg 1440 
gaagcggggc agatggctct gaccctggcc 1500 
gtgcttggca actttgccgc cagcgccatg 1560 
actcttgcgc ttgaggcgca ggtcgcgcat 1620 
gctggtggct cggtcggcca tttcgaagag 1680 
caacgcctgt ttcagctgca gcgccgcctt 1740 
gcgccccgct ccggcgttcg gcgacggctc 1800 
ggcatcgttc tgcatcggga ctctcctgcc 1860 
atcgggcaca gaggatcgat tggcgtcgtc 1920 
agtatctaca ttcattatag agatactttc 1980 

1992 
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ORF3-3 

SEQ ID NO:31 

atgccatttc tactggctac caagcacgcg 
tcgctcggtg tattgctcgc caatttctac 
gcgccacagt tactgatctt tgatctttcc 
ccctccacag cgcagcgtga caggttgagc 
ctggcgcgct tgcgcacccg gccggtgggg 
gctgatcgct atcgcgactc ggcgctggag 
gaaacactct ggtggcacga cgagccgaac 
ctcaggcgaa tcaatgactt cgaacagttg 
ctggtatcgc cggatggcga ggtattgctc 
ggcctgaacc tcacccgaca gggggtcgcc 
tggctcgcgg tctaccgaac cgactacggc 
gcaggtctgc tgctgacccc ggcgctgctc 
accagcagcg tcgtcaaccc ggtgcatcgg 
ttcagccgga cgctgataca gaccgcgccg 
cagcaactgg tgacctgcaa ccacttggcc 
cttgggctga cttccaactg gaagcttttc 
tgtatccagg tcggtgggcg ctatttgcag 
accgaggcgg tactgtgcgt attcaacgac 
ctgtccaatg cgaagcgagc agcggatgcc 
cgcatgagcc atgaaatccg tactcccctg 
gacctgacca ccctgaacga gcggcaacgc 
gcgacgctca tgcaactgat tagcgatgtg 
atggctctga ccctggccgc cttcaatccg 
tttgccgcca gcgccatggc caaggacctg 
gaggcgcagg tcgcgcatgg cttcgaagaa 
gtcggccatt tcgaagaggg tgtcgtcggc 
cagctgcagc gccgccttgt cgcgcacctg 
ggcgttcggc gacggctcgg aagcgaccct 
catcgggact ctcctgccac cctcgcggcc 
ggatcgattg gcgtcgtccg taacgtcaat 
cattatagag atactttcaa atctagatag 
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Figure 7B 



ttgagcgccg attcctcggg agatccgttt 60 
ggaagcttct ggagtgtttc cgcctatccc 120 
ggcagcaccc gcctggcagt gccgtcgatt 180 
ggaagctatc cgatgatagt cgagcgcatt 240 
gaggacgctc agcgtgtcca ttggatacgc 300 
atgttgggag tcgcccgggt tgatctgccg 360 
catctgatca tcgctgcgag cctgcttgat 420 
gttgagcgcc cggcattcga ttcgtacagc 480 
ggcgcggccc ctgcgaccgg cctgagggat 540 
gttcaactgc gcagccagcc tgagaacggc 600 
aatttctttc gccactcccg gtggctggtg 660 
ctggccggtt ggctcgggat gcgttggtac 720 
gcgcaccggc aactggtgga gagcgacacc 780 
gtggctctgg tggtgctgac ccaggatgac 840 
gcccagtggc tgggcgggcc cacggagatc 900 
gatgcgcgtg ggcaggtacc aggagacatc 960 
accgccttcg cggcgacccg ctatgccggc 1020 
atcacggtcc actgcgaggc ggagaccgcg 1080 
gccagccagg ccaagaccct gttcctggcc 1140 
tacggtgtcc ttggcaccct ggagttgctc 1200 
gcctacctac gcaccatcca gagttcgtct 1260 
ctggatgtct cgaagatcga agcggggcag 1320 
ctggacctag tgcgggaagt gcttggcaac 1380 
caggtagacc cgctcgatac tcttgcgctt 1440 
agcgttctgt tcgaggttgc tggtggctcg 1500 
gttgtcgaac aacgcctgca acgcctgttt 1560 
cacgaggatg accggcaggc gccccgctcc 1620 
ggtcaggtgc accacattgg catcgttctg 1680 
gcgcatggaa tggcaaaaat cgggcacaga 1740 
ttccaggcgt caaaaacaag tatctacatt 1800 

1830 
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Figure 7C 

ORF3-4 

SEQ ID NO: 32 

atgatagtcg agcgcattct ggcgcgcttg cgcacccggc cggtggggga ggacgctcag 60 
cgtgtccatt ggatacgcgc tgatcgctat cgcgactcgg cgctggagat gttgggagtc 120 
qcccgggttg atctgccgga aacactctgg tggcacgacg agccgaacca tctgatcatc 180 
gctgcgagcc tgcttgatct caggcgaatc aatgacttcg aacagttggt tgagcgcccg 240 
gcattcgatt cgtacagcct ggtatcgccg gatggcgagg tattgctcgg cgcggcccct 300 
gcgaccggcc tgagggatgg cctgaacctc acccgacagg gggtcgccgt tcaactgcgc 360 
agccagcctg agaacggctg gctcgcggtc taccgaaccg actacggcaa tttctttcgc 420 
cactcccggt ggctggtggc aggtctgctg ctgaccccgg cgctgctcct ggccggttgg 480 
ctcgggatgc gttggtacac cagcagcgtc gtcaacccgg tgcatcgggc gcaccggcaa 540 
ctggtggaga gcgacacctt cagccggacg ctgatacaga ccgcgccggt ggctctggtg 600 
gtgctgaccc aggatgacca gcaactggtg acctgcaacc acttggccgc ccagtggctg 660 
agcgggccca cggagatcct tgggctgact tccaactgga agcttttcga tgcgcgtggg 720 
caggtaccag gagacatctg tatccaggtc ggtgggcgct atttgcagac cgccttcgcg 780 
gcgacccgct atgccggcac cgaggcggta ctgtgcgtat tcaacgacat cacggtccac 840 
tgcgaggcgg agaccgcgct gtccaatgcg aagcgagcag cggatgccgc cagccaggcc 900 
aagaccctgt tcctggcccg catgagccat gaaatccgta ctcccctgta cggtgtcctt 960 
ggcaccctgg agttgctcga cctgaccacc ctgaacgagc ggcaacgcgc ctacctacgc 1020 
accatccaga gttcgtctgc gacgctcatg caactgatta gcgatgtgct ggatgtctcg 1080 
aagatcgaag cggggcagat ggctctgacc ctggccgcct tcaatccgct ggacctagtg' 1140 
cgggaagtgc ttggcaactt tgccgccagc gccatggcca aggacctgca ggtagacccg 1200 
ctcgatactc ttgcgcttga ggcgcaggtc gcgcatggct tcgaagaaag cgttctgttc 1260 
gaggttgctg gtggctcggt cggccatttc gaagagggtg tcgtcggcgt tgtcgaacaa 1320 
cgcctgcaac gcctgtttca gctgcagcgc cgccttgtcg cgcacctgca cgaggatgac 1380 
cggcaggcgc cccgctccgg cgttcggcga cggctcggaa gcgaccctgg tcaggtgcac 1440 
cacattggca tcgttctgca tcgggactct cctgccaccc tcgcggccgc gcatggaatg 1500 
gcaaaaatcg ggcacagagg atcgattggc gtcgtccgta acgtcaattt ccaggcgtca 1560 
aaaacaagta tctacattca ttatagagat actttcaaat ctagatag 1608 
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Figure 7D 

ORF3-5 

SEQ ID NO: 33 

atgttgggag tcgcccgggt tgatctgccg gaaacactct ggtggcacga cgagccgaac 60 
catctgatca tcgctgcgag cctgcttgat ctcaggcgaa tcaatgactt cgaacagttg 120 
gttgagcgcc cggcattcga ttcgtacagc ctggtatcgc cggatggcga ggtattgctc 180 
qqcgcggccc ctgcgaccgg cctgagggat ggcctgaacc tcacccgaca gggggtcgcc .240 
gttcaactgc gcagccagcc tgagaacggc .tggctcgcgg tctaccgaac cgactacggc 300 
aatttctttc gccactcccg gtggctggtg gcaggtctgc tgctgacccc ggcgctgctc 360 
ctggccggtt ggctcgggat gcgttggtac accagcagcg tcgtcaaccc ggtgcatcgg 420 
gcgcaccggc aactggtgga gagcgacacc ttcagccgga cgctgataca gaccgcgccg 4 80 , 
gtggctctgg tggtgctgac ccaggatgac cagcaactgg tgacctgcaa ccacttggcc 540 
gcccagtggc tgggcgggcc cacggagatc cttgggctga cttccaactg gaagcttttc 600 
gatgcgcgtg ggcaggtacc aggagacatc tgtatccagg tcggtgggcg ctatttgcag 660 
accgccttcg cggcgacccg ctatgccggc accgaggcgg tactgtgcgt attcaacgac 720 
atcacggtcc actgcgaggc ggagaccgcg- ctgtccaatg cgaagcgagc agcggatgcc 780 
gccagccagg ccaagaccct gttcctggcc cgcatgagcc atgaaatccg tactcccctg 840 
tacggtgtcc ttggcaccct ggagttgctc gacctgacca ccctgaacga gcggcaacgc 900 
gcctacctac gcaccatcca gagttcgtct gcgacgctca tgcaactgat tagcgatgtg 960 
ctggatgtct cgaagatcga agcggggcag atggctctga ccctggccgc cttcaatccg 1020 
ctggacctag tgcgggaagt gcttggcaac tttgccgcca gcgccatggc caaggacctg 1080 
caggtagacc cgctcgatac tcttgcgctt gaggcgcagg tcgcgcatgg cttcgaagaa 1140 
agcgttctgt tcgaggttgc tggtggctcg gtcggccatt tcgaagaggg tgtcgtcggc 1200 
gttgtcgaac aacgcctgca acgcctgttt cagctgcagc gccgccttgt cgcgcacctg 1260 
cacgaggatg accgg.caggc gccccgctcc ggcgttcggc gacggctcgg aagcgaccct 1320 
ggtcaggtgc accacattgg catcgttctg catcgggact ctcctgccac cctcgcggcc 1380 
gcgcatggaa tggcaaaaat cgggcacaga ggatcgattg gcgtqgtccg taacgtcaat 1440 
ttccaggcgt caaaaacaag tatctacatt cattatagag atactttcaa atctagatag 1500 
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ORF3-6 

SEQ ID NO: 34 

atgcgttggt acaccagcag cgtcgtcaac 
gagagcgaca ccttcagccg gacgctgata 
acccaggatg accagcaact ggtgacctgc 
cccacggaga tccttgggct gacttccaac 
ccaggagaca tctgtatcca ggtcggtggg 
cgctatgccg gcaccgaggc ggtactgtgc 
gcggagaccg cgctgtccaa tgcgaagcga 
ctgttcctgg cccgcatgag ccatgaaatc 
ctggagttgc tcgacctgac caccctgaac 
cagagttcgt ctgcgacgct catgcaactg' 
gaagcggggc agatggctct gaccctggcc 
gtgcttggca actttgccgc cagcgccatg 
actcttgcgc ttgaggcgca ggtcgcgcat 
gctggtggct cggtcggcca tttcgaagag 
caacgcctgt ttcagctgca gcgccgcctt 
gcgccccgct ccggcgttcg gcgacggctc 
ggcatcgttc tgcatcggga ctctcctgcc 
atcgggcaca gaggatcgat tggcgtcgtc 
agtatctaca ttcattatag agatactttc 
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Figure 7E 



ccggtgcatc gggcgcaccg gcaactggtg 60 
cagaccgcgc cggtggctct ggtggtgctg 120 
aaccacttgg ccgcccagtg gctgggcggg 180 
tggaagcttt tcgatgcgcg tgggcaggta .240 
cgctatttgc agaccgcctt cgcggcgacc 300 
gtattcaacg acatcacggt ccactgcgag 360 
gcagcggatg ccgccagcca ggccaagacc 420 
cgtactcccc tgtacggtgt ccttggcacc 4 80 
gagcggcaac gcgcctacct acgcaccatc 540 
attagcgatg tgctggatgt ctcgaagatc 600 
gccttcaatc cgctggacct agtgcgggaa 660 
gccaaggacc tgcaggtaga cccgctcgat 720 
ggcttcgaag aaagcgttct gttcgaggtt 780 
ggtgtcgtcg gcgttgtcga acaacgcctg 84 0 
gtcgcgcacc tgcacgagga tgaccggcag 900 
ggaagcgacc ctggtcaggt gcaccacatt 960 
accctcgcgg ccgcgcatgg aatggcaaaa- 1020 
cgtaacgtca atttccaggc gtcaaaaaca 1080 
aaatctagat ag 1122 
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Figure 7F 

ORF3-2 

SEQ ID NO: 35 

Met Asp Val He Arg Glu His Glu Val Phe Leu Gly Arg He Ala Arg 

x 5 10 15 . 

Lys Ser Asp Lys Thr Thr Gin Lys Tyr Asp Tyr Asp Val Val Pro Leu 

20 25 30 

Gin Arg' His Leu Leu Ala Lys Glu Asn Gly Leu Ala Val Tyr Glu Gly. 

35 40 45 

Arg Glu Phe Ser Phe Ala Met Pro Phe Leu Leu Ala Thr Lys His Ala 

50 55 60 

Leu Ser Ala Asp Ser Ser Gly Asp Pro Phe Ser Leu Gly Val Leu Leu • 
65 70 75 80 

Ala Asn Phe Tyr Gly Ser Phe Trp Ser Val Ser Ala Tyr Pro Ala Pro 

85 90 95 

Gin Leu Leu He Phe Asp Leu Ser Gly Ser Thr Arg Leu Ala Val Pro 

100 105 HO 

Ser He Pro Ser Thr Ala Gin Arg Asp Arg Leu Ser Gly Ser Tyr Pro 

115 120 125 

Met He Val Glu Arg He Leu Ala Arg Leu Arg Thr Arg Pro Val Gly 

130 135 140 

Glu Asp Ala Gin Arg Val His Trp He Arg Ala Asp Arg Tyr Arg Asp 
145 150 155 160 

Ser Ala Leu Glu Met Leu Gly Val Ala Arg Val Asp Leu Pro Glu Thr 

165 170 175 

Leu Trp Trp His Asp Glu Pro Asn His Leu He He Ala Ala Ser Leu 

180 185 190 

Leu Asp Leu Arg Arg He Asn Asp Phe Glu Gin Leu Val Glu Arg Pro 

195 200 205 

Ala Phe Asp Ser Tyr Ser Leu Val Ser Pro Asp Gly Glu Val Leu Leu 

'210 215 220 

Gly Ala Ala Pro Ala Thr Gly Leu Arg Asp Gly Leu Asn Leu Thr Arg. 
225 230 235 240 

Gin Gly Val Ala Val Gin Leu Arg Ser Gin Pro Glu Asn Gly Trp Leu 

245 250 255 

Ala Val Tyr Arg Thr Asp Tyr Gly Asn Phe Phe Arg His Ser Arg. Trp 

260 265 270 

Leu Val Ala Gly Leu Leu Leu Thr Pro Ala Leu Leu Leu Ala Gly Trp 

275 280 285 

Leu Gly Met Arg Trp Tyr Thr Ser Ser Val Val Asn Pro Val His Arg 

290 295 300 

Ala His Arg Gin Leu Val Glu Ser Asp Thr Phe Ser Arg Thr Leu He 
305 " 310 315 320 

Gin Thr Ala Pro Val Ala Leu Val Val Leu Thr Gin Asp Asp Gin Gin 

' 325 330 - 335 

Leu Val Thr Cys Asn His Leu Ala Ala Gin Trp Leu Gly Gly Pro Thr 

340 345 350 

Glu lie Leu Gly Leu Thr Ser Asn Trp Lys Leu Phe Asp Ala Arg Gly 

355 360 365 

Gin Val Pro Gly Asp He Cys He Gin Val Gly Gly Arg Tyr Leu Gin 

370 375 380 

Thr Ala Phe Ala Ala Thr Arg Tyr Ala Gly Thr Glu Ala Val Leu Cys 
385 390 ■ 395 400 

Val Phe Asn Asp He Thr Val His Cys Glu Ala Glu Thr Ala Leu Ser 

405 410 415 

Asn Ala Lys Arg Ala Ala Asp Ala Ala Ser Gin Ala Lys Thr Leu Phe 

420 425 430 

Leu Ala Arg Met Ser His Glu He Arg Thr Pro Leu Tyr Gly Val Leu 

435 440 445 

Gly Thr Leu Glu Leu Leu Asp Leu Thr Thr Leu Asn Glu Arg Gin Arg 
450 455 460 
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Ala 


Tyr 


ilCU 


21tvt 

Ar 9 


Thr 


He 


Gin 


Ser 


Ser 


Ser Ala Thr Leu Met Gin Leu 


465 
















475 480 


lie 




T\ en 


v d j. 


Leu 


Asp 


Val 


Ser 


Lys 


He Glu Ala Gly Gin Met Ala 






A P. ^ 










490 495 


Leu 


inr 


JjCU 


Ala 


A"l a 


Phe 


Asn 


Pro 


Leu 


Asp Leu Val Arg Glu Val Leu 






DUO 












510 


Gly 


Asn 


Phe 


al a 

nld 


Al a 


OCX 


Ala 


Met 


Ala 


Lys Asp Leu Gin Val Asp Pro 




515 










520 




525 


Leu 


Asp 




ucu 


Ala 


Leu 


Glu 


Ala 


Gin 


Val Ala His Gly Phe Glu Glu 


530 










535 






540 


Ser 


Val 


Leu 


rue 


Pi «1 

ulu 


Val 
Val 


Ala Gly 




c pr val Glv His Phe Glu Glu 


545 










550 








RCc, 560 


Gly 


Val 


Val 


oiy 


val 


Val 


Glu 


Gin 




Leu Gin Ara Leu Phe Gin Leu 






rrr 
DOO 










570 575 


Gin 


Arg 


Arg 


Leu 


val 




His 


Leu 


nib 


olu Asrj Astd Ara Gin Ala Pro 




con 










ODD 


590 


Arg 


Car 
OCX 




Val 


Area 




Arg Leu 


Glv 


Ser Asp Pro Gly Gin Val His 




595 








600 




605 


His 


He 


Gly 


He 


Val 


Leu 


His 


Arg 


Asp 


Ser Pro Ala Thr Leu Ala Ala 




610 








615 






620 


Ala 


His 


Gly 


Met 


Ala 


Lys 


He Gly 


His 


Arg Gly Ser He Gly Val Val 


625 








630 








635 640 


Arg 


Asn 


Val 


Asn 


Phe 


Gin 


Ala 


Ser 


Lys 


Thr Ser He Tyr He His Tyr 








645 










650 655 


Arg 


Asp 


Thr 


Phe 


Lys 


Ser 


Arg 












660 
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Figure 7G 

ORF 3-3 

SEQ ID NO: 36 

Met Pro Phe Leu Leu Ala Thr Lys His Ala Leu Ser Ala Asp Ser Ser 

! 5 10 15 

Gly Asp Pro Phe Ser Leu Gly Val Leu Leu Ala Asn Phe Tyr Gly Ser 

20 25 30 

Phe Trp Ser Val Ser Ala Tyr Pro Ala Pro Gin Leu Leu lie Phe Asp 

35 40 45 

Leu Ser Gly Ser Thr Arg Leu Ala Val Pro Ser He Pro Ser Thr Ala 

50 55 60 

Gin Arg Asp Arg Leu Ser Gly Ser Tyr Pro Met He Val Glu Arg He 
65 ^ 70 75 80 

Leu Ala Arg Leu Arg Thr Arg Pro Val Gly Glu Asp Ala Gin Arg Val 

85 90 95 

His Trp He Arg Ala Asp Arg Tyr Arg Asp Ser Ala Leu Glu Met Leu 

100 105 HO 

Gly Val Ala Arg Val Asp Leu Pro Glu Thr Leu Trp Trp His Asp Glu 

115 120 ' 125 

Pro Asn His Leu He He Ala Ala Ser Leu Leu Asp Leu Arg Arg He 

130 135 140 

Asn Asp Phe Glu Gin Leu Val Glu Arg Pro Ala Phe Asp Ser Tyr Ser 
145 150 155 160 

Leu Val Ser Pro Asp Gly Glu Val Leu Leu Gly Ala Ala Pro Ala Thr 

165 170 175 

Gly Leu Arg Asp Gly Leu Asn Leu Thr Arg Gin Gly Val Ala Val Gin 

180 185 190 

Leu Arg Ser Gin Pro Glu Asn Gly Trp Leu Ala Val Tyr Arg Thr Asp 

195 200 205 

Tyr Gly Asn Phe Phe Arg His Ser Arg Trp Leu Val Ala Gly Leu Leu 

210 215 220 

Leu Thr Pro Ala Leu Leu Leu Ala Gly Trp Leu Gly Met Arg Trp Tyr 
225 230 235 240 

Thr Ser Ser Val Val Asn Pro Val His Arg Ala His Arg Gin Leu Val 

245 250 255 

Glu Ser Asp Thr Phe Ser Arg Thr Leu He Gin Thr Ala Pro Val Ala 

260 265 270 

Leu Val Val Leu Thr Gin Asp Asp Gin Gin Leu Val Thr Cys Asn His 

275 280 285 

Leu Ala Ala Gin Trp Leu Gly Gly Pro Thr Glu He Leu Gly Leu Thr 

290 295 300 

Ser Asn Trp Lys Leu Phe Asp Ala Arg Gly Gin Val Pro Gly Asp He 
305 310 315 320 

Cys He Gin Val Gly Gly Arg Tyr Leu Gin Thr Ala Phe Ala Ala Thr 

325 330 335 

Arg Tyr Ala Gly Thr Glu Ala Val Leu Cys Val Phe Asn Asp He Thr 

340 345 350 

Val His Cys Glu Ala Glu Thr Ala Leu Ser Asn Ala Lys Arg Ala Ala 

355 360 365 

Asp Ala Ala Ser Gin Ala Lys Thr Leu Phe Leu Ala Arg Met Ser His 

370 375 380 

Glu He Arg Thr Pro Leu Tyr Gly Val Leu Gly Thr Leu Glu Leu Leu 
385 390 ** 395 400 

Asp Leu Thr Thr Leu Asn Glu Arg Gin Arg Ala Tyr Leu Arg Thr He 

405 410 - 415 

Gin Ser Ser Ser Ala Thr Leu Met Gin Leu He Ser Asp Val Leu Asp 

420 425 430 

Val Ser Lys He Glu Ala Gly Gin Met Ala Leu Thr Leu Ala Ala Phe 

435 440 445 

Asn Pro Leu Asp Leu Val Arg Glu Val Leu Gly Asn Phe Ala Ala Ser 
450 455 460 
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Ala Met Ala Lys Asp Leu Gin Val Asp Pro Leu Asp Thr Leu Ala Leu 
465 * 470 475 480 

Glu Ala Gin Val Ala His Gly Phe Glu Glu Ser Val Leu Phe Glu Val 

485 490 495 

Ala Gly Gly Ser Val Gly His Phe Glu Glu Gly Val Val Gly Val Val 

500 ' 505 510 

Glu Gin Arg Leu Gin Arg Leu Phe Gin Leu Gin Arg Arg Leu Val Ala 

515 520 525 

His Leu His Glu Asp Asp . Arg Gin Ala Pro Arg Ser Gly Val Arg Arg 

530 535 540 

Arg Leu Gly Ser Asp Pro Gly Gin Val His His lie Gly He Val Leu 
545 550 555 560 

His Arg Asp Ser Pro Ala Thr Leu Ala Ala Ala His Gly Met Ala Lys 

565 570, 575 

He Gly His Arg Gly Ser He Gly Val Val Arg Asn Val Asn Phe Gin 

580 585 590 

Ala Ser Lys Thr Ser He Tyr He His Tyr Arg Asp Thr Phe Lys Ser 
595 600 605 

Arg 
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Figure 7H 

ORF3-4 

SEQ ID NO: 37 

Met He Val Glu Arg He Leu Ala Arg Leu Arg Thr Arg Pro Val Gly 

1 5 10 15 

Glu Asp Ala Gin Arg Val His Trp He Arg Ala Asp Arg Tyr Arg Asp 

20 25 30 

Ser Ala Leu Glu Met Leu Gly Val Ala Arg Val Asp Leu. Pro Glu Thr 

35 40 ■ 45 

Leu Trp Trp His Asp Glu Pro Asn His Leu He He Ala Ala Ser Leu 

50 55 60- 

Leu Asp Leu Arg Arg He Asn Asp Phe Glu Gin Leu Val Glu Arg Pro 
» 65 70 75 80 

Ala Phe Asp Ser Tyr Ser Leu Val Ser Pro Asp Gly Glu Val Leu Leu 

85 90 55 

Gly Ala Ala Pro Ala Thr Gly Leu Arg Asp Gly Leu Asn Leu Thr Arg 

• ioo 105 HO 

Gin Gly Val Ala Val Gin Leu Arg Ser Gin Pro Glu Asn Gly Trp Leu 

115 . 120 125 

Ala Val Tyr Arg Thr Asp Tyr Gly Asn Phe Phe Arg His Ser Arg Trp 

130 135 140 

Leu Val Ala Gly Leu Leu Leu Thr Pro Ala Leu Leu Leu Ala Gly Trp 
145 150 155 160 

Leu Gly Met Arg Trp Tyr Thr Ser Ser Val Val Asn Pro Val His Arg 

165 170 175 

Ala His Arg Gin Leu Val Glu Ser Asp Thr Phe Ser Arg Thr Leu lie 

180 185 190 

Gin Thr Ala Pro Val Ala Leu Val Val Leu Thr Gin Asp Asp Gin Gin 

195 200 205 

Leu Val Thr Cys Asn His Leu Ala Ala Gin Trp Leu Gly Gly Pro Thr 

210 215 220 

Glu He Leu Gly Leu Thr Ser Asn Trp Lys Leu Phe Asp Ala Arg Gly 
225 230 235 240 

Gin Val Pro Gly Asp He Cys He Gin Val Gly Gly Arg Tyr Leu Gin 

245 250 255 

Thr Ala Phe Ala Ala Thr Arg Tyr Ala Gly Thr Glu Ala Val Leu Cys 

260 265 270 

Val Phe Asn Asp lie Thr Val His Cys Glu Ala Glu Thr Ala Leu Ser 

275 280 285 

Asn Ala Lys Arg Ala Ala Asp Ala Ala Ser Gin Ala Lys Thr Leu Phe 

290 295 300 

Leu Ala Arg Met Ser His Glu He Arg Thr Pro Leu Tyr Gly Val -Leu 
305 310 315 320 

Gly Thr Leu Glu Leu Leu Asp Leu Thr Thr Leu Asn Glu Arg Gin Arg 

325 330 335 

Ala Tyr Leu Arg Thr He Gin Ser Ser Ser Ala Thr" Leu Met Gin Leu 

340 345 350 

lie Ser Asp Val Leu Asp Val Ser Lys He Glu Ala Gly Gin Met Ala 

355 360 365 

Leu Thr Leu Ala Ala Phe Asn Pro Leu Asp Leu Val Arg Glu .Val Leu 

370 375 380 

Gly Asn Phe Ala Ala Ser Ala Met Ala Lys Asp Leu Gin Val Asp Pro 
385 390 395 400 

Leu Asp Thr Leu Ala Leu Glu Ala Gin Val Ala His Gly Phe Glu Glu 

405 410 415 ■ 

Ser Val Leu Phe Glu Val Ala Gly Gly Ser Val Gly His Phe Glu Glu 

420 425 430 

Gly Val Val Gly Val Val Glu Gin Arg Leu Gin Arg Leu Phe Gin Leu 

435 440 . 445 

Gin Arg Arg Leu Val Ala His Leu His Glu Asp Asp Arg Gin Ala Pro 
450 455 460 
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Arcr Ser Gly Val Arg Arg Arg Leu Gly Ser Asp Pro Gly Gin Val His 
465 470 475 480 

His He Gly He Val Leu His Arg Asp Ser Pro Ala Thr Leu Ala Ala 

485 490 495 

Ala His Gly Met Ala Lys He Gly His Arg Gly Ser He Gly Val Val 

500 ' 505 510 

Arg Asn Val Asn Phe Gin Ala Ser Lys Thr Ser He Tyr He His Tyr 

515 520 525 

Arg Asp Thr Phe Lys Ser Arg 
530 535 
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Figure 71 

ORF3-5 

SEQ" ID NO: 38 

Met Leu Qly Val Ala Arg Val Asp Leu Pro Glu Thr Leu Trp Trp His 

! 5 10 15 

Asp Glu Pro Asn His Leu He He Ala Ala Ser Leu Leu Asp Leu Arg 

20 25 30 

Arg He Asn Asp Phe Glu Gin Leu Val Glu Arg Pro Ala Phe Asp Ser 

35 40 45 

Tyr Ser Leu Val Ser Pro Asp Gly Glu Val Leu Leu Gly Ala Ala Pro 

50 55 60 

Ala Thr Gly Leu Arg Asp Gly Leu Asn Leu Thr Arg Gin Gly Val Ala 
65 " 70 75 80 

Val Gin Leu Arg Ser Gin Pro Glu Asn Gly Trp Leu Ala Val Tyr Arg 

85 90 95 

Thr Asp Tyr Gly Asn Phe Phe Arg His Ser Arg Trp Leu Val Ala Gly 

100 105 HO 

Leu Leu Leu Thr Pro Ala Leu Leu Leu Ala Gly Trp Leu Gly Met Arg 

115 120 125 

Trp Tyr Thr Ser Ser Val Val Asn Pro Val His Arg Ala His Arg Gin 

130 135 140 

Leu Val Glu Ser Asp Thr Phe Ser Arg Thr Leu He Gin Thr Ala Pro 
145 150 155 160 

Val Ala Leu Val Val Leu Thr Gin Asp Asp Gin Gin Leu Val Thr Cys 

165 170 175 

Asn His Leu Ala Ala Gin Trp Leu Gly Gly Pro Thr Glu He Leu Gly 

180 185 190 

Leu Thr Ser Asn Trp Lys Leu Phe Asp Ala Arg Gly Gin Val Pro Gly 

195 200 , 205 

Asp He Cys He Gin Val Gly Gly Arg Tyr Leu Gin Thr Ala Phe Ala 

210 215 220 

Ala Thr Arg Tyr Ala Gly Thr Glu Ala Val Leu Cys Val Phe Asn Asp 
225 230 235 240 

He Thr Val His Cys Glu Ala Glu Thr Ala Leu Ser Asn Ala Lys Arg 

245 250 255 

Ala Ala Asp Ala Ala Ser Gin Ala Lys Thr Leu Phe Leu Ala Arg Met 

260 ' 265 270 

Ser His Glu He Arg Thr Pro Leu Tyr Gly Val Leu Gly Thr Leu Glu 

275 280 285 

Leu Leu Asp Leu Thr Thr Leu Asn Glu Arg Gin Arg Ala Tyr Leu Arg 

290 295 300 

Thr He Gin Ser Ser Ser Ala Thr Leu Met Gin Leu He Ser Asp Val 
305 310 315 320 

Leu Asp Val Ser Lys He Glu Ala Gly Gin Met Ala Leu Thr Leu Ala 

325 330 335 

Ala Phe Asn Pro Leu Asp Leu Val Arg Glu Val Leu Gly Asn Phe Ala 

340 345 350 

Ala Ser Ala Met Ala Lys Asp Leu Gin Val Asp Pro Leu Asp Thr Leu 

355 360 365 

Ala Leu Glu Ala Gin Val Ala His Gly Phe Glu Glu Ser Val Leu Phe 

370 375 380 

Glu Val Ala Gly Gly Ser Val Gly His Phe Glu Glu Gly Val Val Gly 
385 390 395 400 

Val Val Glu Gin Arg Leu Gin Arg Leu Phe Gin Leu Gin Arg Arg Leu i 

405 410 415 

Val Ala His Leu His Glu Asp Asp Arg Gin Ala Pro Arg Ser Gly Val 

420 425 430 

Arg Arg Arg Leu Gly Ser Asp Pro Gly Gin Val His His He Gly He 

435 440 445 

Val Leu His Arg Asp Ser Pro Ala Thr Leu Ala Ala Ala His Gly Met 
450 455 460 
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Ala Lvs He Gly His Arg Gly Ser He Gly Val Val Arg Asn Val Asn 
465 Y 470 475 480 

Phe Gin Ala Ser Lys Thr Ser He Tyr He His Tyr Arg Asp Thr Phe 
485 490 495 

Lys Ser Arg 
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Figure. 7J 

0RF3-6 

SEQ ID NO : 3 3 

Met Arg Trp Tyr Thr Ser Ser Val Val Asn Pro Val His Arg Ala His 

1 5 10 15 . 

Arq Gin Leu Val Glu Ser Asp Thr Phe Ser Arg Thr Leu He Gin Thr 

. 20 25 30 

Ala Pro Val Ala Leu Val Val Leu Thr Gin Asp Asp Gin Gin Leu Val 

35 40 45 

Thr Cys Asn His Leu Ala Ala Gin Trp Leu Gly Gly Pro Thr Glu lie 

50 55 60 

Leu Glv Leu Thr Ser Asn Trp Lys Leu Phe Asp Ala Arg Gly Gin Val 
65 . 70 75 80 

Pro Gly Asp lie Cys lie Gin Val Gly Gly Arg Tyr Leu Gin Thr Ala 

85 90 95 

Phe Ala Ala Thr Arg Tyr Ala Gly Thr Glu Ala Val Leu Cys Val Phe 

100 105 HO 

Asn Asp He Thr Val His Cys Glu Ala Glu Thr Ala Leu Ser Asn Ala 

115 120 125 

Lys Arg Ala Ala Asp Ala Ala Ser Gin Ala Lys Thr Leu Phe Leu Ala 

130 135 140 

Ara Met Ser His Glu He Arg Thr Pro Leu Tyr Gly Val Leu Gly Thr 
14 5 150 155 160 

Leu Glu Leu Leu Asp Leu Thr Thr Leu Asn Glu Arg Gin Arg Ala Tyr 

165 170 175 

Leu Arg Thr He Gin Ser Ser Ser Ala Thr Leu Met Gin Leu He Ser 

180 185 190 

Asp Val Leu Asp Val Ser Lys He Glu Ala Gly Gin Met Ala Leu Thr 

195 200 205 

Leu Ala Ala Phe Asn Pro Leu Asp Leu Val Arg Glu Val Leu Gly Asn 

210 215 220 

Phe Ala Ala Ser Ala Met Ala Lys Asp Leu Gin Val Asp Pro Leu Asp 
225 230 235 240 

Thr Leu Ala Leu Glu Ala Gin Val Ala His Gly Phe Glu Glu Ser Val 

245 250 255 

Leu Phe Glu Val Ala Gly Gly Ser Val Gly His Phe Glu Glu Gly Val 

260 265 270 

Val Gly Val Val Glu Gin Arg Leu Gin Arg Leu Phe Gin Leu Gin Arg 

275 280- 285 

Arg Leu Val Ala His Leu His Glu Asp Asp Arg Gin Ala Pro Arg Ser 

290 295 300 

Gly Val Arg Arg Arg Leu Gly Ser Asp Pro Gly Gin Val His His He 
305 310 315 320 

Gly He Val Leu His Arg Asp Ser Pro Ala Thr Leu Ala Ala Ala His 

325 330 335 

Gly Met Ala Lys He Gly His Arg Gly Ser He Gly Val Val Arg Asn 

340 345 350 

Val Asn Phe Gin Ala Ser Lys Thr Ser lie Tyr He His Tyr Arg Asp 

355 360 365 

Thr Phe Lys Ser Arg 
370 
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SEQUENCE LISTING 

<110> The General Hospital Corporation 

<120> Regulators of Biofilm Formation and Uses 
Thereof 

<130> 00786/39SWO4 

<150> US 60/373,233 
<151> 2002-04-16 

<150> US 60/303,286 
<151> 2001-07-06 

<160> 39 

<170> FastSEQ for Windows Version 4,0 

<210> 1 
<211> 1200 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 1 

atgagctgga aatcctatcg ggtgctggtg gtcgaagatc agccgtttca gcgcgaatac 60 
ctgctcaacc tgtttcgcga gcgcggcgtg cagtacctgg taggtgccgg cgacggcgcg 120 
gaggcgttgc gctgcctgaa gcaggacagg ttcgacctga tcctcagcga tctgatgatg 180 
ccgggcatgg atggtatcca aatgatcctg caactgccgt atctcaagca tcgtccgaag 240 
ctggcgctga tgagctcctc gtcgcagcgg atgatgctca gtgccagccg ggtcgcccag 300 
agtctcggct tgtcggtaat cgacctgttg cccaagccga ctctgcccaa ggccatcggc 360 
caacttctgg aacacctgga aagatgcctc aggcagaagc tggagccgga aaccgacgag 420 
actccgcatg ggcgcacggc gttgctggat gccctgcata acgagcaact ggtgacctgg 4 80 
ttccaggcta agaaatccct ccacaccggg cgcatagtcg gcgccgaggc gttgatacgc 540 
tggagccacc cgcagcatgg cctgttgctg cccagctgtt tcatgagtga tgtcgacgct 600 
accggtctgc acgaggcgtt gctctggcgc gtgctcgaac agaccctgaa cgcccaggaa 660 
tcgtggcgca gggcgggtta cgagattccg gtttcggtga atctgccgcc gcacctgctc 720 
gataaccagg aacttccgga tcgactctat gagtacgtcg gcgctcgcgg ggcttgtacc 780 
agctcactat gtttcgagtt gaccgagagc agtgtcacaa ctctgtcaag taactactat 840 
gcaggtgcct gtcgcttgcg catgaaaggg ttcggattgg cccaggacga ctttggccag 900 
ggttacagct cgttctataa cctggtcacg acgcctttca cggagctgaa gatcgaccgc 960 
tccctagtcc agggatgcgt agaggataac ggcctcaatg cagctgtcat cagttgtatt 1020 
gagttgggtc accgcctgaa tctcgacgtg gtggccgaag gcgtggagac ctgcgaggaa 1080 
ctgaatcttc ttcgtcgtct tggctgcgac cgggcgcagg gtttcctgat ttctaaggca 1140 
gtgtctgctc gtgagttcga gcggcagtta agggaggacg gccccagcct ccttgtttaa 1200 

<210> 2 
<211> 399 
<212> PRT 

<213> Pseudomonas aeruginosa PA14 
<400> 2 

Met Ser.Trp Lys Ser Tyr Arg Val Leu Val Val Glu Asp Gin Pro Phe 

1 5 10 15 

Gin Arg Glu Tyr Leu Leu Asn Leu Phe Arg Glu Arg Gly Val Gin Tyr 
20 25 30 
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Leu 


Val 


Gly Ala 


Gly Asp 


Gly 


Ala 






35 






40 


Asp 


Arg 


Phe Asp 


Leu lie 


Leu 


Ser 




50 






55 




Gly 


lie 


Gin Met 


lie Leu 


Gin 


Leu 


65 






70 






Leu 


Ala 


Leu Met 


Ser Ser 


Ser 


Ser 








85 






Arg 


Val 


Ala Gin 


Ser Leu 


Gly 


Leu 




100 








Pro 


Thr 


Leu Pro 


Lys Ala 


lie 


Gly 






115 






120 


Cys 


Leu 


Arg Gin 


Lys Leu 


Glu 


Pro 




130 






135 




Arg 


Thr 


Ala Leu 


Leu Asp 


Ala 


Leu 


145 






150 






Phe 


Gin 


Ala Lys 


Lys Ser 


Leu 


His 








165 






Ala 


Leu 


lie Arg 


Trp Ser 


His 


Pro 






180 








Cys 


Phe 


Met Ser 


Asp Val 


Asp 


Ala 






.195 






200 


Trp 


Arg 


Val Leu 


Glu Gin 


Thr 


Leu 




210 






215 




Ala 


Gly 


Tyr Glu 


lie Pro 


Val 


Ser 


225 






230 






Asp 


Asn 


Gin Glu 


Leu Pro 


Asp 


Arg 








245 






Gly 


Ala 


Cys Thr 


Ser Ser 


Leu 


Cys 




260 








Thr 


Thr 


Leu Ser 


Ser Asn 


Tyr 


Tyr 






275 






280 


Lys 


Gly 


Phe Gly 


Leu Ala 


Gin 


Asp 




290 






295 




Phe 


Tyr 


Asn Leu 


Val Thr 


Thr 


Pro 


305 






310 






Ser 


Leu 


Val Gin 


Gly Cys 


Val 


Glu 








325 






lie 


Ser 


Cys lie 


Glu Leu 


Gly 


His 






340 








Glu 


Gly 


Val Glu 


Thr Cys 


Glu 


Glu 




355 






360 


Cys 


Asp 


Arg Ala 


Gin Gly 


Phe 


Leu 




370 






375 




Glu 


Phe 


Glu Arg 


Gin Leu 


Arg 


Glu 


385 






390 







Glu 


Ala Leu Arg 


Cys 


Leu 


Lys 


Gin 






45 








Asp 


Leu Met Met 


Pro 


Gly 


Met 


Asp 




60 










Pro 


Tyr Leu Lys 


His 


Arg 


Pro 


Lys 




75 








80 


Gin 


Arg Met Met 


Leu 


Ser 


Ala 


Ser 




90 






95 




Ser 


Val He Asp 


Leu 


Leu 


Pro 


Lys 


105 






110 






Gin 


Leu Leu Glu 


His 


Leu 


Glu 


Arg 






125 






Gly 


Glu 


Thr Asp Glu 


Thr 


Pro 


His 




140 










His 


Asn Glu Gin 


Leu 


Val 


Thr 


Trp 




155 








160 


Thr 


Gly Arg He 


Val 


Gly 


Ala 


Glu 




170 






175 




Gin 


His Gly Leu 


Leu 


Leu 


Pro 


Ser 


185 






190 






Thr 


Gly Leu His 


Glu 


Ala 


Leu 


Leu 






205* 








Asn 


Ala Gin Glu 


Ser Trp 


Arg 


Arg 




220 










Val 


Asn Leu Pro 


Pro 


His 


Leu 


Leu 




235 








240 


Leu 


Tyr Glu Tyr 


Val 


Gly 


Ala 


Arg 




250 






255 




Phe 


Glu Leu Thr 


Glu 


Ser 


Ser 


Val 


265 






270 






Ala 


Gly Ala Cys 


Arg Leu 


Arg 


Met 






285 








Asp 


Phe Gly Gin 


Gly Tyr 


Ser 


Ser 




300 










Phe 


Thr Glu Leu 


Lys 


He 


Asp 


Arg 




315 








320 


Asp 


Asn Gly Leu 


Asn 


Ala 


Ala 


vai 




330 






335 




Arg 


Leu Asn Leu 


Asp Val 


Val 


Ala 


345 






350 






Leu 


Asn Leu Leu 


Arg Arg 


Leu 


Gly 






365 








He 


Ser Lys Ala- 


Val 


Ser 


Ala 


Arg 




380 










Asp 


Gly Pro Ser 


Leu 


Leu 


Val 





395 



<210> 3 
<211> 1416 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 3 

atgagccatg aaatccgcac acctctgtac ggcatgcttg gcacgcttga gctgcttggg 60 
cgtaccgagc tgagtcggca gcaggccggt tacctaaagg caatccagca ttcctcgtcg 120 
accctgctgc aactgatcag cgatgtgctt gacgtatcca agatagaggc cggccaactg 180 
gacctagagt gcgtggaatt ctccccgctg gaattgaccg aagaggtcgt gcagtcgttc 240 
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accggtgccg cgcaggccaa ggggctgcag ttgtatacct gcctctctgc ggagctgccg 300 
ctgcgcatgc ggggggccgc ggcgtcgatc cggcagattc tcaacaacct gctgagcaac 360 
gcggtgaagt tcaccgacaa tggctatgtc aacgtccacc tgaaggccag cgtggtcgat 420 
gccgaatgtg tgatgctgac ctggcaggtc aacgataccg gcatggggat caacgtcgag 480 
gatcagccgc gtctgttcga accgttctac cagatacgcc gctccgagca tccggtcgca 540. 
ggcacgggcc tcggcttgtc gatcagccag cgcctggcgc agctaatgaa tggcagtctg 600 
aaactggtca gtgagctggg gttgggcagc agctttagcc tcaggcttcc gcttgagcgg 660 
atcgcgatgc aggctgagcc gcaggaccta gccgggtgcg ccgtccaagt gctggcgcct 720 
gtccgcgacc taacggaatg cctgtgtggc tggatctccc gctggggtgg aagggccatg 780 
gtcgcgacgc cgaggtcgct ggacgaggcg gacgcgacct cgctgctggt cgaagtgtta 840 
ctgctggagg gggcgccgat gttcgaagca tggccaggat gccgggtgga gctttcccct 900 
cagggtgata tggagccgca ggcacagggc cgcgactggc tgctcgggct caacaacctg 960 
gacggcctgc atcgtgctct gggcctggcc catgggcgtc tcgctgatcc ttcgacgccg 1020 
ccgatacggc tggctccgtt gcgcaatcta ggtctccgcg tcctagtggt ggaggataac 1080 
gcgatcaacc agttgatctt gagggaccag atggaagcgc tgggctgcag cgtggagctg 1140 
ctcttcgatg gtcgcgaggc gttgctgcac tgccagacgg cctgcttcga cgtggtgctc 1200 
accgatatca acatgccgaa catgaacgga tacgagctaa ccgcggagct acggcgccaa 1260 
gggttccggc agccgatcat cggcgcgacg gcgaacgcca tgcgtgagga gcgcgagcgc 1320 
tgcatgtccg ccgggatgaa cgattgcctg gtcaaaccgg tggatctgaa tgcccttcag 1380 
aactgcttga ttaatattct caaggtggat cgatga 1416 

<210> 4 
<211> 471 
<212> PRT 

<213> Pseudomonas aeruginosa PA14 



<400> 4 



Met 


Ser 


His 


Glu 


He 


Arg 


Thr 


Pro 


Leu 


Tyr 


Gly 


Met 


Leu 


Gly 


Thr 


Leu 


1 








5 










10 










15 




Glu 


Leu 


Leu 


Gly 
20 


Arg 


Thr 


Glu 


Leu 


Ser 
25 


Arg 


Gin 


Gin 


Ala 


Gly 
30 


Tyr 


Leu 


Lys 


Ala 


He 


Gin 


His 


Ser 


Ser 


Ser 


Thr 


Leu 


Leu 


Gin 


Leu 


He 


Ser 


Asp 




35 










40 










45 








Val 


Leu 
50 


Asp 


Val 


Ser 


Lys 


He 
55 


Glu 


Ala 


Gly 


Gin 


Leu 
60 


Asp 


Leu 


Glu 


Cys 


Val 


Glu 


Phe 


Ser 


Pro 


Leu 


Glu 


Leu 


Thr 


Glu 


Glu 


Val 


Val 


Gin 


Ser 


Phe 


65 










70 










75 










80 


Thr 


Gly 


Ala 


Ala 


Gin 


Ala 


Lys 


Gly 


Leu 


Gin 


Leu 


Tyr 


Thr 


Cys 


Leu 


Ser 








85 










90 










95 




Ala 


Glu 


Leu 


Pro 
100 


Leu 


Arg 


Met 


Arg 


Gly 
105 


Ala 


Ala 


Ala 


Ser 


He 
110 


Arg 


Gin 


lie 


Leu 


Asn 
115 


Asn 


Leu 


Leu 


Ser 


Asn 
120 


Ala 


Val 


Lys 


Phe 


Thr 
125 


Asp 


Asn 


Gly 


Tyr 


Val 


Asn 


Val 


His 


Leu 


Lys 


Ala 


Ser 


Val 


Val 


Asp 


Ala 


Glu 


Cys 


Val 


130 










135 










14 0 










Met 


Leu 


Thr 


Trp 


Gin 


Val 


Asn 


Asp 


Thr 


Gly 


Met 


Gly 


lie 


Asn 


Val 


Glu 


145 










150 










155 










160 


Asp 


Gin 


Pro 


Arg 


Leu 


Phe 


Glu 


Pro 


Phe 


Tyr 


Gin 


He 


Arg 


Arg 


Ser 


Glu 








165 










170 










175 




His 


Pro 


Val 


Ala 
180 


Gly 


Thr 


Gly 


Leu 


Gly 
185 


Leu 


Ser 


He 


Ser 


Gin 
190 


Arg 


Leu 


Ala 


Gin 


Leu 
195 


Met 


Asn 


Gly 


Ser 


Leu 
200 


Lys 


Leu 


Val 


Ser 


Glu 
205 


Leu 


Gly 


Leu 


Gly 


Ser 


Ser 


Phe 


Ser 


Leu 


Arg 


Leu 


Pro 


Leu 


Glu 


Arg 


He 


Ala 


Met 


Gin 


210 










215 










220 










Ala 


Glu 


Pro 


Gin 


Asp 


Leu 


Ala 


Gly 


Cys 


Ala 


Val 


Gin 


Val 


Leu 


Ala 


Pro 


225 










230 










235 










240 


Val 


Arg 


Asp 


Leu 


Thr 


Glu 


Cys 


Leu 


Cys 


Gly 


Trp 


He 


Ser 


Arg 


Trp 


Gly 
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z 4 b 








OCA 










o a 




Giy 


Arg 


Ala ruec 


Va T Ala 
vai Ala. 


1 Hi 


Pro 


Arg 




T.on 
XtcU. 


7A qr\ 


oxu 


AT a 

nld 


nop 


Ala 






260 








265 










OTA 

J /o 






Thr 


Ser 


Leu Leu 


TT- 1 pi,, 

vai giu 


vai 


Leu 


Leu 


Leu 


tjlU 


Hi ir 

vj±y 


AT a 
Ala 


Pro 


i v ieu 








275 






o o a 










ope 
z bb 








Glu 


Ala 


Trp Pro 


Gly Cys 


Arg 


17—, T 

vai 


G1U 


Leu 


Ser 


Pro 


ijin 


uiy 


Asp 


Mo h 




290 






295 










o a a 
300 










Glu 


Pro 


Gin Ala 


Gin Gly 


Arg 


ASp 


Trp 


Leu 


Leu 


Giy 


Leu 


Asn 


Asn 


T 

Leu 


305 






310 










TIC 

315 












Asp 


G±y 


Leu His 


Arg Ala 


Leu 


Gly 


Leu 


Aia 


tt -J e-i 

nlS 


m ir 

bly 


Arg 


Leu 


ai a 
Ala 


Asp 








325 








O O A 

330 










"5 "a c: 
33 3 


Pro 


Ser 


Thr Pro 


Pro lie 


Arg 


T /"Ml 

Leu 


Ala 


Pro 


Leu 


Arg 


Asn 


Leu 


\jiy 


Leu 






340 








345 










3b0 






Arg 


vai 


Leu vai 


tT-il Pill 

vai giu 


Asp 


Asn 


Ala 


lie 


Asn 


uin 


Leu 


11c 


Leu 


Arg 




355 






360 










ice 
job 








Asp 


Gin 


Met Glu 


Ala Leu 


pi mm 

Giy 


Cys 


Ser 


Vai 


Glu 


Leu 


Leu 




Asp 


vjiy 


370 






375 










380 










Arg 


Glu 


Ala Leu 


Leu His 


Cys 


Gin 


Thr 


Ala 


Cys 


pne 


Asp 


vai 


vai 


Leu 


385 






390 










395 










AAA 
4U0 


1 XIX 


Asp 


Tl o TV en 
IXC AOil 








Asn Gly Tyr 


Glu 


Leu 


Thr 


Ala 


Glu 






405 








410 










415 




Leu 


Arg 


Arg Gin 


Gly Phe 


Arg 


Gin 


Pro 


lie 


lie 


Gly 


Ala 


Thr 


Ala 


Asn 






420 








425 










430 






Ala 


Met 


Arg Glu 


Glu Arg 


Glu 


Arg 


Cys 


Met 


Ser 


Ala 


Gly 


Met 


Asn 


Asp 






435 






440 










445 








Cys 


Leu 


Vai Lys 


Pro Vai 


Asp 


Leu 


Asn 


Ala 


Leu 


Gin 


Asn 


Cys 


Leu 


He 


450 






455 










460 










Asn 


He 


Leu Lys 


Vai Asp 


Arg 





















465 470 



<210> 5 
<211> 1995 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 5 

atgatggatg ttatacggga 
aagaccaccc agaagtacga 
gaaaaeggat tageggtcta 
gctaccaagc aegegttgag 
ctcgccaatt tetaeggaag 
atctttgatc tttccggcag 
cgtgacaggt tgagcggaag 
acccggccgg tgggggagga 
gactcggcgc tggagatgtt 
cacgacgagc cgaaccatct 
gacttcgaac agttggttga 
ggegaggtat tgctcggcgc 
cgacaggggg tcgccgttca 
cgaaccgact aeggcaattt 
accccggcgc tgctcctggc 
aacccggtgc ategggegea 
atacagaccg cgccggtggc 
tgcaaccact tggccgccca 
aactggaagc ttttcgatgc 
gggegctatt tgcagaccgc 
tgegtattea acgacatcac 



gcatgaggta tttcttgggc 
etatgaegtg gtgcctttgc 
tgagggaegg gagttttcct 
cgccgattcc tegggagate 
cttctggagt gtttccgcct 
cacccgcctg gcagtgccgt 
ctatccgatg atagtcgagc 
cgctcagcgt gtccattgga 
gggagtcgee egggttgate 
gatcatcget gcgagcctgc 
gcgcccggca ttcgattcgt 
ggcccctgcg accggcctga 
actgcgcagc cagectgaga 
ctttcgccac tcccggtggc 
cggttggctc gggatgcgtt 
ccggcaactg gtggagagcg 
tctggtggtg ctgacccagg 
gtggctgggc gggcccacgg 
gcgtgggcag gtaccaggag 
cttcgcggcg acccgctatg 
ggtccactgc gaggeggaga 



gcatcgctcg aaaaagegae 60 
ageggcaett gttggcaaag 120 
ttgetatgee atttctactg 180 
cgttttcgct cggtgtattg 240 
atcccgcgcc acagttactg 300 
cgattccctc cacagcgcag 360 
gcattctggc gcgcttgcgc 420 
tacgegctga tcgctatcgc 480 
tgeeggaaac actctggtgg 540 
ttgatctcag gcgaatcaat 600 
acagcctggt ategceggat 660 
gggatggcct gaacctcacc 720 
aeggctgget cgcggtctac 780 
tggtggcagg tetgetgetg 840 
ggtacaccag cagcgtcgtc 900 
acaccttcag ccggacgctg 960 
atgaccagca actggtgacc 1020 
agatccttgg gctgacttcc 1080 
acatctgtat ecaggteggt 1140 
ccggcaccga ggcggtactg 1200 
ccgcgctgtc caatgegaag 12 60 
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cgagcagcgg atgccgccag ccaggccaag accctgttcc tggcccgcat gagccatgaa 1320 
atccgtactc ccctgtacgg tgtccttggc accctggagt tgctcgacct gaccaccctg 1380 
aacgagcggc aacgcgccta cctacgcacc atccagagtt cgtctgcgac gctcatgcaa 1440 
ctgattagcg atgtgctgga tgtctcgaag atcgaagcgg ggcagatggc tctgaccctg 1500 
gccgccttca atccgctgga cctagtgcgg gaagtgcttg gcaactttgc cgccagcgcc 1560 
atggccaagg acctgcaggt agacccgctc gatactcttg cgcttgaggc gcaggtcgcg 1620 
catggcttcg aagaaagcgt tctgttcgag gttgctggtg gctcggtcgg ccatttcgaa 1680 
gagggtgtcg tcggcgttgt cgaacaacgc ctgcaacgcc tgtttcagct gcagcgccgc 1740 
cttgtcgcgc acctgcacga ggatgaccgg caggcgcccc gctccggcgt tcggcgacgg 1800 
ctcggaagcg accctggtca ggtgcaccac attggcatcg ttctgcatcg ggactctcct 1860 
gccaccctcg cggccgcgca tggaatggca aaaatcgggc acagaggatc gattggcgtc 1920 
gtccgtaacg tcaatttcca ggcgtcaaaa acaagtatct acattcatta tagagatact 1980 
ttcaaatcta gatag 1995 

<210> 6 
<211> 664 
<212> PRT 

<213> Pseudomonas aeruginosa PA14 

<400> 6 ^rs 

Met Met Asp Val lie Arg Glu His Glu Val Phe Leu Gly\Arg lie Ala 

1 5 10 J 15 

Arg Lys Ser Asp Lys Thr Thr Gin Lys Tyr Asp Tyr Asp Val Val Pro 

20 25 „ . 30 

Leu Gin Arg His Leu Leu Ala Lys Glu Asn^Gly \eu Ala Val Tyr Glu 

35 40 ^ 45 

Gly Arg Glu Phe Ser Phe Ala Met Pro Phe Leu Leu Ala Thr Lys His 

50 55 ^— v 60 m 

Ala Leu Ser Ala Asp Ser Serfcly Asp Pro Phe Ser Leu (clyNVal Leu 

65 70 75 V — ' 80 

Leu Ala Asn Phe TyrC^ly^Ser Phe Trp Ser Val Ser 'Ala Tyr Pro Ala 

85 ^ 90 95 

Pro Gin Leu Leu lie Phe Asp Leu Ser tGly^Ser Thr Arg Leu Ala Val 

100 105 ' 110 

Pro Ser He Pro Ser Thr Ala Gin Arg Asp Arg Leu Ser'QSly^Ser Tyr 

115 120 125 ' 

Pro Met He Val Glu Arg He Leu Ala Arg Leu Arg Thr Arg Pro Val 

130 135 140 

Gly Glu Asp Ala Gin Arg Val His Trp He Arg Ala Asp Arg Tyr Arg 
145 150 155 160 

Asp Ser Ala Leu Glu Met LeuU3ly\Val Ala Arg Val Asp Leu Pro Glu 

165 170 175 

Thr Leu Trp Trp His Asp Glu Pro Asn His Leu He He Ala Ala Ser 

180 185 190 

Leu Leu Asp Leu Arg Arg He Asn Asp Phe Glu Gin Leu Val Glu Arg 

195 200 2Q5 

Pro Ala Phe Asp Ser Tyr Ser Leu Val Ser Pro Asp^GlyNGlu Val Leu 

210 215 220 V — ' 

Leu (qiyNAla Ala Pro Ala Thr xSy^eu Arg Asp^Gl^Leu Asn Leu Thr 

225 ^^/—v 230 S 235 Nw-/ 240 

Arg Gin <Gly\val Ala Val Gin Leu Arg Ser Gin Pro Glu Asn (plyTrp 

^--^ 245 ^-^250 255^ 

Leu Ala Val Tyr Arg Thr Asp Tyr ^ly^Asn Phe Phe Arg His Ser Arg 

260^^^ 270 ^ ^ 

Trp Leu Val Ala c <31y\Leu Leu Leu Thr Pro Ala Leu Leu Leu Ala^GlyX 

Z25 280 285 

Trp Leu 6ly)Met Arg Trp Tyr Thr Ser Ser Val Val Asn Pro Val His 
290 295 300 
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Arg Ala 
305 

lie Gin 
Gin Leu 
Thr Glu 



31y)Gln 
370 
Gin Thr 
385 

Cys Val 
Ser Asn 
Phe Leu 
Leu ply 



Arg Ala 
465 

Leu He 

Ala Leu 

Leu^y^ 

Pro Leu 
530 
Glu Ser 

Glu ^Sly) 

Leu Gin 

Pro Arg 

His His 
610 
Ala Ala 
625 

Val Arg 



His Arg Gin Leu 
310 

Thr Ala Pro Val 
325 

Val Thr Cys Asn 

He Leu(Gly jLeu 
355 

Val Pro^Gl^Asp 

Ala Phe Ala Ala 
390 

Phe Asn Asp He 
405 

Ala Lys Arg Ala 
420 

Ala Arg Met Ser 
435 

'hr Leu Glu Leu 

Tyr Leu Arg Thr 
470 

Ser Asp Val Leu 
485 

Thr Leu Ala Ala 
500 

)Asn Phe Ala Ala 
515 

Asp Thr Leu Ala 

Val Leu Phe Glu 

Val Val (^ly^Val 

Arg Arg Leu Val 

Ser^^lyNval Arg 
595 

He Gly He Val 



His^y^et Ala 

Asn Val Asn Phe 
645 

Tyr Arg Asp Thr Phe Lys 
660 



Val Glu 

Ala Leu 

His Leu 

Thr Ser 
360 
He Cys 
375 

Thr Arg 

Thr Val 

Ala Asp 

His Glu 
440 
Leu Asp 
455 

He Gin 

Asp Val 

Phe Asn 

Ser Ala 
520 
Leu Glu 
535 

Val Ala 

Val Glu 

Ala His 

Arg Arg 
600 
Leu His 
615 

Lys He 
Gin Ala 
Ser Arg 



Ser Asp Thr 
315 

Val Val Leu 

330 
Ala Ala Gin 
345 

Asn Trp Lys 

He Gin Val( 

Tyr Ala (Gly 

His Cys Glu 

410 
Ala Ala Ser 
425 

He Arg Thr 

Leu Thr Thr 

Ser Ser Ser 
475 

Ser Lys He 

490 
Pro Leu Asp 
505 

Met Ala Lys 
Ala Gin Val 
Ser 

Gin Arg Leu 

570 
Leu His Glu 

LeuK?ly) Ser 

Arg Asp Ser 

(6l^)His Arg 
y 635 
Ser Lys Thr 
650 



Phe Ser 

Thr Gin 

Trp Leu 

Leu Phe 
365 

& 

)Thr Glu 

Ala Glu 

Gin Ala 

Pro Leu 
445 
Leu Asn 
460 

Ala Thr 

Glu Ala 

Leu Val 

Asp Leu 
525 
Ala His 
540 

Val Gly 

Gin Arg 

Asp Asp 

Asp Pro 
605 
Pro Ala 
620 
'^ly^Ser 

Ser He 



Arg Thr Leu 
320 

Asp Asp Gin 

Gly (5iy\ro 

350 X 

Asp Ala Arg 

Arg Tyr Leu 

Ala Val Leu 
400 

Thr Ala Leu 

415 
Lys Thr Leu 

Tyr^piy)Val 

Glu Arg Gin 

Leu Met Gin 

e480 
Gin Met 
495 
Arg Glu Val 
510 

Gin Val Asp 

<|E^)Phe. Glu 

His Phe Glu 
560 

Leu Phe Gin 

575 
Arg Gin Ala 
590 

Gly Gin Val 

Thr Leu Ala 

He(c^v Val 
640 

Tyr He His 
655 



<210> 7 
<211> 6410 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 7 

cttcctcacc gcactttcat tcaccgcacc gttatcggcg tcatggacaa tgccctgcag 60 
tcctgagacc tcattaggaa tcatctgata gagggaattt ttccatgccg tcagtctttc 120 
ggcatgaagt tgaagaattt cttacagcct tttgatagcg gtttctccac tccgagtgct 180 
gcgctcaagc tgctccgcat gctcggtggc gccttgatgt tgtgcgtgct atgcagcctg 240 



-6- 



WO 03/004689 



PCT7US02/21431 



atattcagtg tgagcatggt tttaaaccat 

gtggctatgt acgaagcgca gctttatttc 

agcggcaatg tcgtgccctt ggccgcgggt 

gtgagcatcc tgccgttgag tgacggaggg 

ggtgatctcc gggaaaagcg gctggcactg 

ctggtttacc ggcttaccgc cgatggtagg 

aaagaggtgt accgagcctt gctggcgact 

ggtggtaccc ctcaacggct gtaccttttt 

tggctaggcc tggagattct cggcgaagac 

ggaaactaca tgctgctgga tcagcatggg 

ctggggagcg gtgcgtcgcg gacgcttttg 

ggcccactgc cgcagcatat ggtgcttttc 

atctatcaca tcggtatcgg tcgcctgttg 

tctgcgttgg cactcgcagt cggcatccta 

cgcttgatag agcccgcaaa gcgacgcctt 

cgtgcagtta tccaggccgc gcccgtcgcg 

gtggtcctgg aaaatcccca ggcgcgccaa 

gacgcgccga gatggatttc ccaggcgttc 

ctggaaaccg aggcagggct acatcttcat 

gaagacgtat tgttctgcgc cttcagtgaa 

ctggctcgcg caaaatccct ggcggatgct 

accatgagcc atgaaatccg cacacctctg 

gggcgtaccg agctgagtcg gcagcaggcc 

tcgaccctgc tgcaactgat cagcgatgtg 

ctggacctag agtgcgtgga attctccccg 

ttcaccggtg ccgcgcaggc caaggggctg 

ccgctgcgca tgcggggggc cgcggcgtcg 

aacgcggtga agttcaccga caatggctat 

gatgccgaat gtgtgatgct gacctggcag 

gaggatcagc cgcgtctgtt cgaaccgttc 

gcaggcacgg gcctcggctt gtcgatcagc 

ctgaaactgg tcagtgagct ggggttgggc 

cggatcgcga tgcaggctga gccgcaggac 

cctgtccgcg acctaacgga atgcctgtgt 

atggtcgcga cgccgaggtc gctggacgag 

ttactgctgg agggggcgcc gatgttcgaa 

cctcagggtg atatggagcc gcaggcacag 

ctggacggcc tgcatcgtgc tctgggcctg 

ccgccgatac ggctggctcc gttgcgcaat 

aacgcgatca accagttgat cttgagggac 

ctgctcttcg atggtcgcga ggcgttgctg 

ctcaccgata tcaacatgcc gaacatgaac 

caagggttcc ggcagccgat catcggcgcg 

cgctgcatgt ccgccgggat gaacgattgc 

cagaactgct tgattaatat tctcaaggtg 

tggtggtcga agatcaigccg tttcagcgcg 

gcgtgcagta cctggtaggt gccggcgacg 

acaggttcga cctgatcctc agcgatctga 

tcctgcaact gccgtatctc aagcatcgtc 

agcggatgat gctcagtgcc agccgggtcg 

tgttgcccaa gccgactctg cccaaggcca 

gcctcaggca gaagctggag ccggaaaccg 

tggatgccct gcataacgag caactggtga 

ccgggcgcat agtcggcgcc gaggcgttga 

tgctgcccag ctgtttcatg agtgatgtcg 

ggcgcgtgct cgaacagacc ctgaacgccc 

ttccggtttc ggtgaatctg ccgccgcacc 

tctatgagta cgtcggcgct cgcggggctt 



caggtgtccc tcagtcggca agctatgaat 300 
gagcagcgcg aggcgttgct caatcacttg 360 
agagcgctcg tcaacgaagc gccgaacaat 42 0 
cgaggtctgc tattgaccgc tcgcacgctc 480 
atgtatctgg tcgataccga caaaggccct 540 
ccctcggcag cgatatccag cacgataacc 600 
ccgtcggcgc ctgttcactg ggtgactgac 660 
gaatccttag gcgatgagcc gggcgagggg 720 
ctcgattcga tgttgcgccg gaatgatgcc 780 
caggtcgtac tcgctacgga cgcagaggcg 840 
cgtggagacg gcttcggttt catcggtgct 900 
cagcacgtgg ggtcttcgag ctgggatctg 960 
ctggctctgt ggctccctct gttacttgcc 1020 
ctgcattggc tggtgcggag catcgagcga 1080 
gaagcattga aggagagcga agccttttcc 1140 
ctgtgcgtgc tgcgtcgtgc cgacgccgca 12 00 
tggctgggtg atagcgaggc gattgcccac 1260 
gcaggaggtg tgaagtgttc tggagaagaa 1320 
ctcaattaca cgcccacccg ctataacggt 13 80 
atcagtgcac gcaagcggat ggaggcggaa 1440 
gccaatgaag ccaagacgct gtttctcgcc 1500 
tacggcatgc ttggcacgct tgagctgctt 1560 
ggttacctaa aggcaatcca gcattcctcg 1620 
cttgacgtat ccaagataga ggccggccaa 1680 
ctggaattga ccgaagaggt cgtgcagtcg 1740 
cagttgtata cctgcctctc tgcggagctg 1800 
atccggcaga ttctcaacaa cctgctgagc 1860 
gtcaacgtcc acctgaaggc cagcgtggtc 1920 
gtcaacgata ccggcatggg gatcaacgtc 1980 
taccagatac gccgctccga gcatccggtc 2040 
cagcgcctgg cgcagctaat gaatggcagt 2100 
agcagcttta gcctcaggct tccgcttgag 2160 
ctagccgggt gcgccgtcca agtgctggcg 222 0 
ggctggatct cccgctgggg tggaagggcc 2280 
gcggacgcga cctcgctgct ggtcgaagtg 2340 
gcatggccag gatgccgggt ggagctttcc 2400 
ggccgcgact ggctgctcgg gctcaacaac 2460 
gcccatgggc gtctcgctga tccttcgacg 2520 
ctaggtctcc gcgtcctagt ggtggaggat 2580 
cagatggaag cgctgggctg cagcgtggag 2640 
cactgccaga cggcctgctt cgacgtggtg 2700 
ggatacgagc taaccgcgga gctacggcgc 2760 
acggcgaacg ccatgcgtga ggagcgcgag 2820 
ctggtcaaac cggtggatct gaatgccctt 2880 
gatcgatgag ctggaaatcc tatcgggtgc 2940 
aatacctgct caacctgttt cgcgagcgcg 3000 
gcgcggaggc gttgcgctgc ctgaagcagg 3 060 
tgatgccggg catggatggt atccaaatga 312 0 
cgaagctggc gctgatgagc tcctcgtcgc 3180 
cccagagtct cggcttgtcg gtaatcgacc 3240 
tcggccaact tctggaacac ctggaaagat 33 00 
acgagactcc gcatgggcgc acggcgttgc 33 60 
cctggttcca ggctaagaaa tccctccaca 342 0 
tacgctggag ccacccgcag catggcctgt 3480 
acgctaccgg tctgcacgag gcgttgctct 3540 
aggaatcgtg gcgcagggcg ggttacgaga 3600 
tgctcgataa ccaggaactt ccggatcgac 3660 
gtaccagctc actatgtttc gagttgaccg 3720 
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agagcagtgt cacaactctg tcaagtaact actatgcagg tgcctgtcgc ttgcgcatga 3780 
aagggttcgg attggcccag gacgactttg gccagggtta cagctcgttc tataacctgg 3840 
tcacgacgcc tttcacggag ctgaagatcg accgctccct agtccaggga tgcgtagagg 3900 
ataacggcct caatgcagct gtcatcagtt gtattgagtt gggtcaccgc ctgaatctcg 3960 
acgtggtggc cgaaggcgtg gagacctgcg aggaactgaa tcttcttcgt cgtcttggct 4020 
gcgaccgggc gcagggtttc ctgatttcta aggcagtgtc tgctcgtgag ttcgagcggc 4080 
agttaaggga ggacggcccc agcctccttg tttaacgcag tatccccatt atcgcggagt 4140 
cgatcgcagc caaccaccgt cagcgcaaca gtgtcgctga aggaggccgc atcccgtgaa 4200 
gtctgctagt gccttggagc acgacaacaa acttttgctc aaatggacaa ccctctcgca 4260 
gagcctgagc atcggcttga tctgtgtggt ggtgctgacc gtattgctgt tcagcatctg 4320 
ttactggtcg ctggggagat tgtttcagga ggaggaggac aaagtctcct tccacttcac 4380 
ccgtatgatg gatgttatac gggagcatga ggtatttctt gggcgcatcg ctcgaaaaag 444 0 
cgacaagacc acccagaagt acgactatga cgtggtgcct ttgcagcggc acttgttggc 4500 
aaaggaaaac ggattagcgg tctatgaggg acgggagttt tcctttgcta tgccatttct 4560 
actggctacc aagcacgcgt tgagcgccga ttcctcggga gatccgtttt cgctcggtgt 4620 
attgctcgcc aatttctacg gaagcttctg gagtgtttcc gcctatcccg cgccacagtt 4680 
actgatcttt gatctttccg gcagcacccg cctggcagtg ccgtcgattc cctccacagc 474 0 
gcagcgtgac aggttgagcg gaagctatcc gatgatagtc gagcgcattc tggcgcgctt 4800 
gcgcacccgg ccggtggggg aggacgctca gcgtgtccat tggatacgcg ctgatcgcta 4860 
tcgcgactcg gcgctggaga tgttgggagt cgcccgggtt gatctgccgg aaacactctg 4920 
gtggcacgac gagccgaacc atctgatcat cgctgcgagc ctgcttgatc tcaggcgaat 4980 
caatgacttc gaacagttgg ttgagcgccc ggcattcgat tcgtacagcc tggtatcgcc 504 0 
ggatggcgag gtattgctcg gcgcggcccc tgcgaccggc ctgagggatg gcctgaacct 5100 
cacccgacag ggggtcgccg ttcaactgcg cagccagcct gagaacggct ggctcgcggt 5160 
ctaccgaacc gactacggca atttctttcg ccactcccgg tggctggtgg caggtctgct 5220 
gctgaccccg gcgctgctcc tggccggttg gctcgggatg cgttggtaca ccagcagcgt 5280 
cgtcaacccg gtgcatcggg cgcaccggca actggtggag agcgacacct tcagccggac 534 0 
gctgatacag accgcgccgg tggctctggt ggtgctgacc caggatgacc agcaactggt 5400 
gacctgcaac cacttggccg cccagtggct gggcgggccc acggagatcc ttgggctgac 5460 
ttccaactgg aagcttttcg atgcgcgtgg gcaggtacca ggagacatct gtatccaggt 5520 
cggtgggcgc tatttgcaga ccgccttcgc ggcgacccgc tatgccggca ccgaggcggt 5580 
actgtgcgta ttcaacgaca tcacggtcca ctgcgaggcg gagaccgcgc tgtccaatgc 5640 
gaagcgagca gcggatgccg ccagccaggc caagaccctg ttcctggccc gcatgagcca 5700 
tgaaatccgt actcccctgt acggtgtcct tggcaccctg gagttgctcg acctgaccac 5760 
cctgaacgag cggcaacgcg cctacctacg caccatccag agttcgtctg cgacgctcat 5820 
gcaactgatt agcgatgtgc tggatgtctc gaagatcgaa gcggggcaga tggctctgac 5880 
cctggccgcc ttcaatccgc tggacctagt gcgggaagtg cttggcaact ttgccgccag 5940 
cgccatggcc .aaggacctgc aggtagaccc gctcgatact cttgcgcttg aggcgcaggt 6000 
cgcgcatggc ttcgaagaaa gcgttctgtt cgaggttgct ggtggctcgg tcggccattt 6060 
cgaagagggt gtcgtcggcg ttgtcgaaca acgcctgcaa cgcctgtttc agctgcagcg 6120 
ccgccttgtc gcgcacctgc acgaggatga ccggcaggcg ccccgctccg gcgttcggcg 6180 
acggctcgga agcgaccctg gtcaggtgca ccacattggc atcgttctgc atcgggactc 6240 
tcctgccacc ctcgcggccg cgcatggaat ggcaaaaatc gggcacagag gatcgattgg 63 00 
cgtcgtccgt aacgtcaatt tccaggcgtc aaaaacaagt atctacattc attatagaga 6360 
tactttcaaa tctagataga gggttttcct atagacatga ctctcttcac 6410 



<210> 8 
<211> 2796 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 8 

atgaagttga agaatttctt acagcctttt gatagcggtt tctccactcc gagtgctgcg 60 
ctcaagctgc tccgcatgct cggtggcgcc ttgatgttgt gcgtgctatg cagcctgata 12 0 
ttcagtgtga gcatggtttt aaaccatcag gtgtccctca gtcggcaagc tatgaatgtg 180 
gctatgtacg aagcgcagct ttatttcgag cagcgcgagg cgttgctcaa tcacttgagc 240 
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ggcaatgtcg tgcccttggc cgcgggtaga gcgctcgtca acgaagcgcc gaacaatgtg 300 
agcatcctgc cgttgagtga cggagggcga ggtctgctat tgaccgctcg cacgctcggt 360 
gatctccggg aaaagcggct ggcactgatg tatctggtcg ataccgacaa aggccctctg 420 
gtttaccggc ttaccgccga tggtaggccc tcggcagcga tatccagcac gataaccaaa 480 
gaggtgtacc gagccttgct ggcgactccg tcggcgcctg ttcactgggt gactgacggt 540 
ggtacccctc aacggctgta cctttttgaa tccttaggcg atgagccggg cgaggggtgg 600 
ctaggcctgg agattctcgg cgaagacctc gattcgatgt tgcgccggaa tgatgccgga 660 
aactacatgc tgctggatca gcatgggcag gtcgtactcg ctacggacgc agaggcgctg 72 0 
gggagcggtg cgtcgcggac gcttttgcgt ggagacggct tcggtttcat cggtgctggc 780 
ccactgccgc agcatatggt gcttttccag cacgtggggt cttcgagctg ggatctgatc 840 
tatcacatcg gtatcggtcg cctgttgctg gctctgtggc tccctctgtt acttgcctct 900 
gcgttggcac tcgcagtcgg catcctactg cattggctgg tgcggagcat cgagcgacgc 960 
ttgatagagc ccgcaaagcg acgccttgaa gcattgaagg agagcgaagc cttttcccgt 1020 
gcagttatcc aggccgcgcc cgtcgcgctg tgcgtgctgc gtcgtgccga cgccgcagtg 1080 
gtcctggaaa atccccaggc gcgccaatgg ctgggtgata gcgaggcgat tgcccacgac 1140 
gcgccgagat ggatttccca ggcgttcgca ggaggtgtga agtgttctgg agaagaactg 1200 
gaaaccgagg cagggctaca tcttcatctc aattacacgc ccacccgcta taacggtgaa 1260 
gacgtattgt tctgcgcctt cagtgaaatc agtgcacgca agcggatgga ggcggaactg 1320 
gctcgcgcaa aatccctggc ggatgctgcc aatgaagcca agacgctgtt tctcgccacc 13 80 
atgagccatg aaatccgcac acctctgtac ggcatgcttg gcacgcttga gctgcttggg 1440 
cgtaccgagc tgagtcggca gcaggccggt tacctaaagg caatccagca ttcctcgtcg 1500 
accctgctgc aactgatcag cgatgtgctt gacgtatcca agatagaggc cggccaactg 1560 
gacctagagt gcgtggaatt ctccccgctg gaattgaccg aagaggtcgt gcagtcgttc 1620 
accggtgccg cgcaggccaa ggggctgcag ttgtatacct gcctctctgc ggagctgccg 1680 
ctgcgcatgc ggggggccgc ggcgtcgatc cggcagattc tcaacaacct gctgagcaac 1740 
gcggtgaagt tcaccgacaa tggctatgtc aacgtccacc tgaaggccag cgtggtcgat 1800 
gccgaatgtg tgatgctgac ctggcaggtc aacgataccg gcatggggat caacgtcgag 1860 
gatcagccgc gtctgttcga accgttctac cagatacgcc gctccgagca tccggtcgca 1920 
ggcacgggcc tcggcttgtc gatcagccag cgcctggcgc agctaatgaa tggcagtctg 1980 
aaactggtca gtgagctggg gttgggcagc agctttagcc tcaggcttcc gcttgagcgg 2040 
atcgcgatgc aggctgagcc gcaggaccta gccgggtgcg ccgtccaagt gctggcgcct 2100 
gtccgcgacc taacggaatg cctgtgtggc tggatctccc gctggggtgg aagggccatg 2160 
gtcgcgacgc cgaggtcgct ggacgaggcg gacgcgacct cgctgctggt cgaagtgtta 2220 
ctgctggagg gggcgccgat gttcgaagca tggccaggat gccgggtgga gctttcccct 2280 
cagggtgata tggagccgca ggcacagggc cgcgactggc tgctcgggct caacaacctg 2340 
gacggcctgc atcgtgctct gggcctggcc catgggcgtc tcgctgatcc ttcgacgccg 24 00 
ccgatacggc tggctccgtt gcgcaatcta ggtctccgcg tcctagtggt ggaggataac 2460 
gcgatcaacc agttgatctt gagggaccag atggaagcgc tgggctgcag cgtggagctg 2520 
ctcttcgatg gtcgcgaggc gttgctgcac tgccagacgg cctgcttcga cgtggtgctc 2580 
accgatatca acatgccgaa catgaacgga tacgagctaa ccgcggagct acggcgccaa 2640 
gggttccggc agccgatcat cggcgcgacg gcgaacgcca tgcgtgagga gcgcgagcgc 2700 
tgcatgtccg ccgggatgaa cgattgcctg gtcaaaccgg tggatctgaa tgcccttcag 2760 
aactgcttga ttaatattct caaggtggat cgatga 2796 

<210> 9 
<211> 2721 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 9 

atgctcggtg gcgccttgat gttgtgcgtg ctatgcagcc tgatattcag 
gttttaaacc atcaggtgtc cctcagtcgg caagctatga atgtggctat 
cagctttatt tcgagcagcg cgaggcgttg ctcaatcact tgagcggcaa 
ttggccgcgg gtagagcgct cgtcaacgaa gcgccgaaca atgtgagcat 
agtgacggag ggcgaggtct gctattgacc gctcgcacgc tcggtgatct 
cggctggcac tgatgtatct ggtcgatacc gacaaaggcc ctctggttta 
gccgatggta ggccctcggc agcgatatcc agcacgataa ccaaagaggt 
ttgctggcga ctccgtcggc gcctgttcac tgggtgactg acggtggtac 



tgtgagcatg 60 
gtacgaagcg 120 
tgtcgtgccc 180 
cctgccgttg 240 
ccgggaaaag 300 
ccggcttacc 360 
gtaccgagcc 420 
ccctcaacgg 480 
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ctgtaccttt ttgaatcctt aggcgatgag ccgggcgagg ggtggctagg cctggagatt 540 
ctcggcgaag acctcgattc gatgttgcgc cggaatgatg ccggaaacta catgctgctg 600 
gatcagcatg ggcaggtcgt actcgctacg gacgcagagg cgctggggag cggtgcgtcg 660 
cggacgcttt tgcgtggaga cggcttcggt ttcatcggtg ctggcccact gccgcagcat 720 
atggtgcttt tccagcacgt ggggtcttcg agctgggatc tgatctatca catcggtatc 780 
ggtcgcctgt tgctggctct gtggctccct ctgttacttg cctctgcgtt ggcactcgca 840 
gtcggcatcc tactgcattg gctggtgcgg agcatcgagc gacgcttgat agagcccgca 900 
aagcgacgcc ttgaagcatt gaaggagagc gaagcctttt cccgtgcagt tatccaggcc 960 
gcgcccgtcg cgctgtgcgt gctgcgtcgt gccgacgccg cagtggtcct ggaaaatccc 1020 
caggcgcgcc aatggctggg tgatagcgag gcgattgccc acgacgcgcc gagatggatt 1080 
tcccaggcgt tcgcaggagg tgtgaagtgt tctggagaag aactggaaac cgaggcaggg 1140 
ctacatcttc atctcaatta cacgcccacc cgctataacg gtgaagacgt attgttctgc 1200 
gccttcagtg aaatcagtgc acgcaagcgg atggaggcgg aactggctcg cgcaaaatcc 1260 
ctggcggatg ctgccaatga agccaagacg ctgtttctcg ccaccatgag ccatgaaatc 1320 
cgcacacctc tgtacggcat gcttggcacg cttgagctgc ttgggcgtac cgagctgagt 1380 
cggcagcagg ccggttacct aaaggcaatc cagcattcct cgtcgaccct gctgcaactg 1440 
atcagcgatg tgcttgacgt atccaagata gaggccggcc aactggacct agagtgcgtg 1500 
gaattctccc cgctggaatt gaccgaagag gtcgtgcagt cgttcaccgg tgccgcgcag 1560 
gccaaggggc tgcagttgta tacctgcctc tctgcggagc tgccgctgcg catgcggggg 1620 
gccgcggcgt cgatccggca gattctcaac aacctgctga gcaacgcggt gaagttcacc 1680 
gacaatggct atgtcaacgt ccacctgaag gccagcgtgg tcgatgccga atgtgtgatg 1740 
ctgacctggc aggtcaacga taccggcatg gggatcaacg tcgaggatca gccgcgtctg 1800 
ttcgaaccgt tctaccagat acgccgctcc gagcatccgg tcgcaggcac gggcctcggc 1860 
ttgtcgatca gccagcgcct ggcgcagcta atgaatggca gtctgaaact ggtcagtgag 1920 
ctggggttgg gcagcagctt tagcctcagg cttccgcttg agcggatcgc gatgcaggct 1980 
gagccgcagg acctagccgg gtgcgccgtc caagtgctgg cgcctgtccg cgacctaacg 2040 
gaatgcctgt gtggctggat ctcccgctgg ggtggaaggg ccatggtcgc gacgccgagg 2100 
tcgctggacg aggcggacgc gacctcgctg ctggtcgaag tgttactgct ggagggggcg 2160 
ccgatgttcg aagcatggcc aggatgccgg gtggagcttt cccctcaggg tgatatggag 2220 
ccgcaggcac agggccgcga ctggctgctc gggctcaaca acctggacgg cctgcatcgt 22 80 
gctctgggcc tggcccatgg gcgtctcgct gatccttcga cgccgccgat acggctggct 2340 
ccgttgcgca atctaggtct ccgcgtccta gtggtggagg ataacgcgat caaccagttg 2400 
atcttgaggg accagatgga agcgctgggc tgcagcgtgg agctgctctt cgatggtcgc 2460 
gaggcgttgc tgcactgcca gacggcctgc ttcgacgtgg tgctcaccga tatcaacatg 2520 
ccgaacatga acggatacga gctaaccgcg gagctacggc gccaagggtt ccggcagccg 2580 
atcatcggcg cgacggcgaa cgccatgcgt gaggagcgcg agcgctgcat gtccgccggg 2640 
atgaacgatt gcctggtcaa accggtggat ctgaatgccc ttcagaactg cttgattaat 2700 
attctcaagg tggatcgatg a 2721 

<210> 10 
<211> 2703 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 10 

atgttgtgcg tgctatgcag cctgatattc agtgtgagca tggttttaaa ccatcaggtg 60 
tccctcagtc ggcaagctat gaatgtggct atgtacgaag cgcagcttta tttcgagcag 12 0 
cgcgaggcgt tgctcaatca cttgagcggc aatgtcgtgc ccttggccgc gggtagagcg 180 
ctcgtcaacg aagcgccgaa caatgtgagc atcctgccgt tgagtgacgg agggcgaggt 240 
ctgctattga ccgctcgcac gctcggtgat ctccgggaaa agcggctggc actgatgtat 300 
ctggtcgata ccgacaaagg ccctctggtt taccggctta ccgccgatgg taggccctcg 360 
gcagcgatat ccagcacgat aaccaaagag gtgtaccgag ccttgctggc gactccgtcg 420 
gcgcctgttc actgggtgac tgacggtggt acccctcaac ggctgtacct ttttgaatcc 480 
ttaggcgatg agccgggcga ggggtggcta ggcctggaga ttctcggcga agacctcgat 540 
tcgatgttgc gccggaatga tgccggaaac tacatgctgc tggatcagca tgggcaggtc 600 
gtactcgcta cggacgcaga ggcgctgggg agcggtgcgt cgcggacgct tttgcgtgga 660 
gacggcttcg gtttcatcgg tgctggccca ctgccgcagc atatggtgct tttccagcac 720 
gtggggtctt cgagctggga tctgatctat cacatcggta tcggtcgcct gttgctggct 780 
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ctgtggctcc ctctgttact tgcctctgcg ttggcactcg cagtcggcat cctactgcat 840 
tggctggtgc ggagcatcga gcgacgcttg atagagcccg caaagcgacg ccttgaagca 900 
ttgaaggaga gcgaagcctt ttcccgtgca gttatccagg ccgcgcccgt cgcgctgtgc 960 
gtgctgcgtc gtgccgacgc cgcagtggtc ctggaaaatc cccaggcgcg ccaatggctg 1020 
ggtgatagcg aggcgattgc ccacgacgcg ccgagatgga tttcccaggc gttcgcagga 1080 
ggtgtgaagt gttctggaga agaactggaa accgaggcag ggctacatct tcatctcaat 1140 
tacacgccca cccgctataa cggtgaagac gtattgttct gcgccttcag tgaaatcagt 1200 
gcacgcaagc ggatggaggc ggaactggct cgcgcaaaat ccctggcgga tgctgccaat 1260 
gaagccaaga cgctgtttct cgccaccatg agccatgaaa tccgcacacc tctgtacggc 1320 
atgcttggca cgcttgagct gcttgggcgt accgagctga gtcggcagca ggccggttac 1380 
ctaaaggcaa tccagcattc ctcgtcgacc ctgctgcaac tgatcagcga tgtgcttgac 1440 
gtatccaaga tagaggccgg ccaactggac ctagagtgcg tggaattctc cccgctggaa 1500 
ttgaccgaag aggtcgtgca gtcgttcacc ggtgccgcgc aggccaaggg gctgcagttg 1560 
tatacctgcc tctctgcgga gctgccgctg cgcatgcggg gggccgcggc gtcgatccgg 1620 
cagattctca acaacctgct gagcaacgcg gtgaagttca ccgacaatgg ctatgtcaac 1680 
gtccacctga aggccagcgt ggtcgatgcc gaatgtgtga tgctgacctg gcaggtcaac 1740 
gataccggca tggggatcaa cgtcgaggat cagccgcgtc tgttcgaacc gttctaccag 1800 
atacgccgct ccgagcatcc ggtcgcaggc acgggcctcg gcttgtcgat cagccagcgc 1860 
ctggcgcagc taatgaatgg cagtctgaaa ctggtcagtg agctggggtt gggcagcagc 1920 
tttagcctca ggcttccgct tgagcggatc gcgatgcagg ctgagccgca ggacctagcc 1980 
gggtgcgccg tccaagtgct ggcgcctgtc cgcgacctaa cggaatgcct gtgtggctgg 2040 
atctcccgct ggggtggaag ggccatggtc gcgacgccga ggtcgctgga cgaggcggac 2100 
gcgacctcgc tgctggtcga agtgttactg ctggaggggg cgccgatgtt cgaagcatgg 2160 
ccaggatgcc gggtggagct ttcccctcag ggtgatatgg agccgcaggc acagggccgc 2220 
gactggctgc tcgggctcaa caacctggac ggcctgcatc gtgctctggg cctggcccat 2280 
gggcgtctcg ctgatccttc gacgccgccg atacggctgg ctccgttgcg caatctaggt 2340 
ctccgcgtcc tagtggtgga ggataacgcg atcaaccagt tgatcttgag ggaccagatg 2400 
gaagcgctgg gctgcagcgt ggagctgctc ttcgatggtc gcgaggcgtt gctgcactgc 2460 
cagacggcct gcttcgacgt ggtgctcacc gatatcaaca tgccgaacat gaacggatac 2520 
gagctaaccg cggagctacg gcgccaaggg ttccggcagc cgatcatcgg cgcgacggcg 2580 
aacgccatgc gtgaggagcg cgagcgctgc atgtccgccg ggatgaacga ttgcctggtc 2640 
aaaccggtgg atctgaatgc ccttcagaac tgcttgatta atattctcaa ggtggatcga 2700 
tga ~ 2703 

<210> 11 
<211> 2664 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 11 

atggttttaa accatcaggt gtccctcagt cggcaagcta tgaatgtggc tatgtacgaa 60 
gcgcagcttt atttcgagca gcgcgaggcg ttgctcaatc acttgagcgg caatgtcgtg 120 
cccttggccg cgggtagagc gctcgtcaac gaagcgccga acaatgtgag catcctgccg 180 
ttgagtgacg gagggcgagg tctgctattg accgctcgca cgctcggtga tctccgggaa 240 
aagcggctgg cactgatgta tctggtcgat accgacaaag gccctctggt ttaccggctt 300 
accgccgatg gtaggccctc ggcagcgata tccagcacga taaccaaaga ggtgtaccga 360 
gccttgctgg cgactccgtc ggcgcctgtt cactgggtga ctgacggtgg tacccctcaa 42 0 
cggctgtacc tttttgaatc cttaggcgat gagccgggcg aggggtggct aggcctggag 480 
attctcggcg aagacctcga ttcgatgttg cgccggaatg atgccggaaa ctacatgctg 540 
ctggatcagc atgggcaggt cgtactcgct acggacgcag aggcgctggg gagcggtgcg 600 
tcgcggacgc ttttgcgtgg agacggcttc ggtttcatcg gtgctggccc actgccgcag 660 
catatggtgc ttttccagca cgtggggtct tcgagctggg atctgatcta tcacatcggt 720 
atcggtcgcc tgttgctggc tctgtggctc cctctgttac ttgcctctgc gttggcactc 780 
gcagtcggca tcctactgca ttggctggtg cggagcatcg agcgacgctt gatagagccc 840 
gcaaagcgac gccttgaagc attgaaggag agcgaagcct tttcccgtgc agttatccag 900 
gccgcgcccg tcgcgctgtg cgtgctgcgt cgtgccgacg ccgcagtggt cctggaaaat 960 
ccccaggcgc gccaatggct gggtgatagc gaggcgattg cccacgacgc gccgagatgg 1020 
atttcccagg cgttcgcagg aggtgtgaag tgttctggag aagaactgga aaccgaggca 1080 
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gggctacatc ttcatctcaa ttacacgccc 
tgcgccttca gtgaaatcag tgcacgcaag 
tccctggcgg atgctgccaa tgaagccaag 
atccgcacac ctctgtacgg catgcttggc 
agtcggcagc aggccggtta cctaaaggca 
ctgatcagcg atgtgcttga cgtatccaag 
gtggaattct ccccgctgga attgaccgaa 
caggccaagg ggctgcagtt gtatacctgc 
ggggccgcgg cgtcgatccg gcagattctc 
accgacaatg gctatgtcaa cgtccacctg 
atgctgacct ggcaggtcaa cgataccggc 
ctgttcgaac cgttctacca gatacgccgc 
ggcttgtcga tcagccagcg cctggcgcag 
gagctggggt tgggcagcag ctttagcctc 
gctgagccgc aggacctagc cgggtgcgcc 
acggaatgcc tgtgtggctg gatctcccgc 
aggtcgctgg acgaggcgga cgcgacctcg 
gcgccgatgt tcgaagcatg gccaggatgc 
gagccgcagg cacagggccg cgactggctg 
cgtgctctgg gcctggccca tgggcgtctc 
gctccgttgc gcaatctagg tctccgcgtc 
ttgatcttga gggaccagat ggaagcgctg 
cgcgaggcgt tgctgcactg ccagacggcc 
atgccgaaca tgaacggata cgagctaacc 
ccgatcatcg gcgcgacggc gaacgccatg 
gggatgaacg attgcctggt caaaccggtg 
aatattctca aggtggatcg atga 

<210> 12 
<211> 2625 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 12 

atgaatgtgg ctatgtacga agcgcagctt 
cacttgagcg gcaatgtcgt gcccttggcc 
aacaatgtga gcatcctgcc gttgagtgac 
acgctcggtg atctccggga aaagcggctg 
ggccctctgg tttaccggct taccgccgat 
ataaccaaag' aggtgtaccg agccttgctg 
actgacggtg gtacccctca acggctgtac 
gaggggtggc taggcctgga gattctcggc 
gatgccggaa actacatgct gctggatcag 
gaggcgctgg ggagcggtgc gtcgcggacg 
ggtgctggcc cactgccgca gcatatggtg 
gatctgatct atcacatcgg tatcggtcgc 
cttgcctctg cgttggcact cgcagtcggc 
gagcgacgct tgatagagcc cgcaaagcga 
ttttcccgtg cagttatcca ggccgcgccc 
gccgcagtgg tcctggaaaa tccccaggcg 
gcccacgacg cgccgagatg gatttcccag 
gaagaactgg aaaccgaggc agggctacat 
aacggtgaag acgtattgtt ctgcgccttc 
gcggaactgg ctcgcgcaaa atccctggcg 
ctcgccacca tgagccatga aatccgcaca 
ctgcttgggc gtaccgagct gagtcggcag 
tcctcgtcga ccctgctgca actgatcagc 
ggccaactgg acctagagtg cgtggaattc 
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acccgctata acggtgaaga cgtattgttc 1140 
cggatggagg cggaactggc tcgcgcaaaa 1200 
acgctgtttc tcgccaccat gagccatgaa 12 60 
acgcttgagc tgcttgggcg taccgagctg 1320 
atccagcatt cctcgtcgac cctgctgcaa 1380 
atagaggccg gccaactgga cctagagtgc 1440 
gaggtcgtgc agtcgttcac cggtgccgcg 1500 
ctctctgcgg agctgccgct gcgcatgcgg 1560 
aacaacctgc tgagcaacgc ggtgaagttc 1620 
aaggccagcg tggtcgatgc cgaatgtgtg 1680 
atggggatca acgtcgagga tcagccgcgt 1740 
tccgagcatc cggtcgcagg cacgggcctc 1800 
ctaatgaatg gcagtctgaa actggtcagt 1860 
aggcttccgc ttgagcggat cgcgatgcag 1920 
gtccaagtgc tggcgcctgt ccgcgaccta 1980 
tggggtggaa gggccatggt cgcgacgccg 2 040 
ctgctggtcg aagtgttact gctggagggg 2100 
cgggtggagc tttcccctca gggtgatatg 2160 
ctcgggctca acaacctgga cggcctgcat 2220 
gctgatcctt cgacgccgcc gatacggctg 2280 
ctagtggtgg aggataacgc gatcaaccag 234 0 
ggctgcagcg tggagctgct cttcgatggt 2400 
tgcttcgacg tggtgctcac cgatatcaac 2460 
gcggagctac ggcgccaagg gttccggcag 2520 
cgtgaggagc gcgagcgctg catgtccgcc 2580 
gatctgaatg cccttcagaa ctgcttgatt 2640 

. 2664 



tatttcgagc agcgcgaggc gttgctcaat 60 
gcgggtagag cgctcgtcaa cgaagcgccg 120 
ggagggcgag gtctgctatt gaccgctcgc 180 
gcactgatgt atctggtcga taccgacaaa 240 
ggtaggccct cggcagcgat atccagcacg 300. 
gcgactccgt cggcgcctgt tcactgggtg 360 
ctttttgaat ccttaggcga tgagccgggc 420 
gaagacctcg attcgatgtt gcgccggaat 480 
catgggcagg tcgtactcgc tacggacgca 540 
cttttgcgtg gagacggctt cggtttcatc 600 
cttttccagc acgtggggtc ttcgagctgg 660 
ctgttgctgg ctctgtggct ccctctgtta 720 
atcctactgc attggctggt gcggagcatc 780 
cgccttgaag cattgaagga gagcgaagcc 840 
gtcgcgctgt gcgtgctgcg tcgtgccgac 900 
cgccaatggc tgggtgatag cgaggcgatt 960 
gcgttcgcag gaggtgtgaa gtgttctgga 1020 
cttcatctca attacacgcc cacccgctat 1080 
agtgaaatca gtgcacgcaa gcggatggag 1140 
gatgctgcca atgaagccaa gacgctgttt 1200 
cctctgtacg gcatgcttgg cacgcttgag 1260 
caggccggtt acctaaaggc aatccagcat 1320 
gatgtgcttg acgtatccaa gatagaggcc 13 80 
tccccgctgg aattgaccga agaggtcgtg 1440 
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cagtcgttca ccggtgccgc gcaggccaag gggctgcagt tgtatacctg cctctctgcg 1500 
gagctgccgc tgcgcatgcg gggggccgcg gcgtcgatcc ggcagattct caacaacctg 1560 
ctgagcaacg cggtgaagtt caccgacaat ggctatgtca acgtccacct gaaggccagc 1620 
gtggtcgatg ccgaatgtgt gatgctgacc tggcaggtca acgataccgg catggggatc 1680 
aacgtcgagg atcagccgcg tctgttcgaa ccgttctacc agatacgccg ctccgagcat 1740 
ccggtcgcag gcacgggcct cggcttgtcg atcagccagc gcctggcgca gctaatgaat 1800 
ggcagtctga aactggtcag tgagctgggg ttgggcagca gctttagcct caggcttccg 1860 
cttgagcgga tcgcgatgca ggctgagccg caggacctag ccgggtgcgc cgtccaagtg 1920 
ctggcgcctg tccgcgacct aacggaatgc ctgtgtggct ggatctcccg ctggggtgga 1980 
agggccatgg tcgcgacgcc gaggtcgctg gacgaggcgg acgcgacctc gctgctggtc 2040 
gaagtgttac tgctggaggg ggcgccgatg ttcgaagcat ggccaggatg ccgggtggag 2100 
ctttcccctc agggtgatat ggagccgcag gcacagggcc gcgactggct gctcgggctc 2160 
aacaacctgg acggcctgca tcgtgctctg ggcctggccc atgggcgtct cgctgatcct 2220 
tcgacgccgc cgatacggct ggctccgttg cgcaatctag gtctccgcgt cctagtggtg 2280 
gaggataacg cgatcaacca gttgatcttg agggaccaga tggaagcgct gggctgcagc 2340 
gtggagctgc tcttcgatgg tcgcgaggcg ttgctgcact gccagacggc ctgcttcgac 2400 
gtggtgctca ccgatatcaa catgccgaac atgaacggat acgagctaac cgcggagcta 2460 
cggcgccaag ggttccggca gccgatcatc ggcgcgacgg cgaacgccat gcgtgaggag 2520 
cgcgagcgct gcatgtccgc cgggatgaac gattgcctgg tcaaaccggt ggatctgaat 258 0 
gcccttcaga actgcttgat taatattctc aaggtggatc gatga 2625 

<210> 13 
<211> 2613 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 13 

atgtacgaag cgcagcttta tttcgagcag cgcgaggcgt tgctcaatca cttgagcggc 60 
aatgtcgtgc ccttggccgc gggtagagcg ctcgtcaacg aagcgccgaa caatgtgagc 120 
atcctgccgt tgagtgacgg agggcgaggt ctgctattga ccgctcgcac gctcggtgat 180 
ctccgggaaa agcggctggc actgatgtat ctggtcgata ccgacaaagg ccctctggtt 240 
taccggctta ccgccgatgg taggccctcg gcagcgatat ccagcacgat aaccaaagag 300 
gtgtaccgag ccttgctggc gactccgtcg gcgcctgttc actgggtgac tgacggtggt 360 
acccctcaac ggctgtacct ttttgaatcc ttaggcgatg agccgggcga ggggtggcta 420 
ggcctggaga ttctcggcga agacctcgat tcgatgttgc gccggaatga tgccggaaac 480 
tacatgctgc tggatcagca tgggcaggtc gtactcgcta cggacgcaga ggcgctgggg 540 
agcggtgcgt cgcggacgct tttgcgtgga gacggcttcg gtttcatcgg tgctggccca 600 
ctgccgcagc atatggtgct tttccagcac gtggggtctt cgagctggga tctgatctat 660 
cacatcggta tcggtcgcct gttgctggct ctgtggctcc ctctgttact tgcctctgcg 720 
ttggcactcg cagtcggcat cctactgcat tggctggtgc ggagcatcga gcgacgcttg 780 
atagagcccg caaagcgacg cctt'gaagca ttgaaggaga gcgaagcctt ttcccgtgca 840 
gttatccagg ccgcgcccgf cgcgctgtgc gtgctgcgtc gtgccgacgc cgcagtggtc 900 
ctggaaaatc cccaggcgcg ccaatggctg ggtgatagcg aggcgattgc ccacgacgcg 960 
ccgagatgga tttcccaggc gttcgcagga ggtgtgaagt gttctggaga agaactggaa 1020 
accgaggcag ggctacatct tcatctcaat tacacgccca cccgctataa cggtgaagac 1080 
gtattgttct gcgccttcag tgaaatcagt gcacgcaagc ggatggaggc ggaactggct 1140 
cgcgcaaaat ccctggcgga tgctgccaat gaagccaaga cgctgtttct cgccaccatg 1200 
agccatgaaa tccgcacacc tctgtacggc atgcttggca cgcttgagct gcttgggcgt 12 60 
accgagctga gtcggcagca ggccggttac ctaaaggcaa tccagcattc ctcgtcgacc 1320 
ctgctgcaac tgatcagcga tgtgcttgac gtatccaaga tagaggccgg ccaactggac 1380 
ctagagtgcg tggaattctc cccgctggaa ttgaccgaag aggtcgtgca gtcgttcacc 1440 
ggtgccgcgc aggccaaggg gctgcagttg tatacctgcc tctctgcgga gctgccgctg 1500 
cgcatgcggg gggccgcggc gtcgatccgg cagattctca acaacctgct gagcaacgcg 15 60 
gtgaagttca ccgacaatgg ctatgtcaac gtccacctga aggccagcgt ggtcgatgcc 162 0 
gaatgtgtga tgctgacctg gcaggtcaac gataccggca tggggatcaa cgtcgaggat 1680 
cagccgcgtc tgttcgaacc gttctaccag atacgccgct ccgagcatcc ggtcgcaggc 1740 
acgggcctcg gcttgtcgat cagccagcgc ctggcgcagc taatgaatgg cagtctgaaa 1800 
ctggtcagtg agctggggtt gggcagcagc tttagcctca ggcttccgct tgagcggatc 1860 
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gcgatgcagg ctgagccgca ggacctagcc gggtgcgccg tccaagtgct ggcgcctgtc 1920 
cgcgacctaa cggaatgcct gtgtggctgg atctcccgct ggggtggaag ggccatggtc 1980 
gcgacgccga ggtcgctgga cgaggcggac gcgacctcgc tgctggtcga agtgttactg 2040 
ctggaggggg cgccgatgtt cgaagcatgg ccaggatgcc gggtggagct ttcccctcag 2100 
ggtgatatgg agccgcaggc acagggccgc gactggctgc tcgggctcaa caacctggac 2160 
ggcctgcatc gtgctctggg cctggcccat gggcgtctcg ctgatccttc gacgccgccg 2220 
atacggctgg ctccgttgcg caatctaggt ctccgcgtcc tagtggtgga ggataacgcg 2280 
atcaaccagt tgatcttgag ggaccagatg gaagcgctgg gctgcagcgt ggagctgctc 2340 
ttcgatggtc gcgaggcgtt gctgcactgc cagacggcct gcttcgacgt ggtgctcacc 2400 
gatatcaaca tgccgaacat gaacggatac gagctaaccg cggagctacg gcgccaaggg 2460 
ttccggcagc cgatcatcgg cgcgacggcg aacgccatgc gtgaggagcg cgagcgctgc 2520 
atgtccgccg ggatgaacga ttgcctggtc aaaccggtgg atctgaatgc ccttcagaac 2580 
tgcttgatta atattctcaa ggtggatcga tga 2613 

<210> 14 
<211> 2409 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 14 

atgtatctgg tcgataccga caaaggccct ctggtttacc ggcttaccgc cgatggtagg 60 
ccctcggcag cgatatccag cacgataacc aaagaggtgt accgagcctt gctggcgact 120 
ccgtcggcgc ctgttcactg ggtgactgac ggtggtaccc ctcaacggct gtaccttttt 180 
gaatccttag gcgatgagcc gggcgagggg tggctaggcc tggagattct cggcgaagac 240 
ctcgattcga tgttgcgccg gaatgatgcc ggaaactaca tgctgctgga tcagcatggg 30 0 
caggtcgtac tcgctacgga cgcagaggcg ctggggagcg gtgcgtcgcg gacgcttttg 360 
cgtggagacg gcttcggttt catcggtgct ggcccactgc cgcagcatat ggtgcttttc 420 
cagcacgtgg ggtcttcgag ctgggatctg atctatcaca tcggtatcgg tcgcctgttg 480 
ctggctctgt ggctccctct gttacttgcc tctgcgttgg cactcgcagt cggcatccta 540 
ctgcattggc tggtgcggag catcgagcga cgcttgatag agcccgcaaa gcgacgcctt 600 
gaagcattga aggagagcga agccttttcc cgtgcagtta tccaggccgc gcccgtcgcg 660 
ctgtgcgtgc tgcgtcgtgc cgacgccgca gtggtcctgg aaaatcccca ggcgcgccaa 720 
tggctgggtg atagcgaggc gattgcccac gacgcgccga gatggatttc ccaggcgttc 780 
gcaggaggtg tgaagtgttc tggagaagaa ctggaaaccg aggcagggct acatcttcat 840 
ctcaattaca cgcccacccg ctataacggt gaagacgtat tgttctgcgc cttcagtgaa 900 
atcagtgcac gcaagcggat ggaggcggaa ctggctcgcg caaaatccct ggcggatgct 960 
gccaatgaag ccaagacgct gtttctcgcc accatgagcc atgaaatccg cacacctctg 1020 
tacggcatgc ttggcacgct tgagctgctt gggcgtaccg agctgagtcg gcagcaggcc 1080 
ggttacctaa aggcaatcca gcattcctcg tcgaccctgc tgcaactgat cagcgatgtg 1140 
cttgacgtat ccaagataga ggccggccaa ctggacctag agtgcgtgga attctccccg 1200 
ctggaattga ccgaagaggt cgtgcagtcg ttcaccggtg ccgcgcaggc caaggggctg 1260 
cagttgtata cctgcctctc tgcggagctg ccgctgcgca tgcggggggc cgcggcgtcg 1320 
atccggcaga ttctcaacaa cctgctgagc aacgcggtga agttcaccga caatggctat 1380 
gtcaacgtcc acctgaaggc cagcgtggtc gatgccgaat gtgtgatgct gacctggcag 1440 
gtcaacgata ccggcatggg gatcaacgtc gaggatcagc cgcgtctgtt cgaaccgttc 1500 
taccagatac gccgctccga gcatccggtc gcaggcacgg gcctcggctt gtcgatcagc 1560 
cagcgcctgg cgcagctaat gaatggcagt ctgaaactgg tcagtgagct ggggttgggc 1620 
agcagcttta gcctcaggct tccgcttgag cggatcgcga tgcaggctga gccgcaggac 1680 
ctagccgggt gcgccgtcca agtgctggcg cctgtccgcg acctaacgga atgcctgtgt 1740 
ggctggatct cccgctgggg tggaagggcc 1 atggtcgcga cgccgaggtc gctggacgag 1800 
gcggacgcga cctcgctgct ggtcgaagtg ttactgctgg agggggcgcc gatgttcgaa 1860 
gcatggccag gatgccgggt ggagctttcc cctcagggtg atatggagcc gcaggcacag 1920 
ggccgcgact ggctgctcgg gctcaacaac ctggacggcc tgcatcgtgc tctgggcctg 1980 
gcccatgggc gtctcgctga tccttcgacg ccgccgatac ggctggctcc gttgcgcaat 204 0 
ctaggtctcc gcgtcctagt ggtggaggat aacgcgatca accagttgat cttgagggac 2100 
cagatggaag cgctgggctg cagcgtggag ctgctcttcg atggtcgcga ggcgttgctg 2160 
cactgccaga cggcctgctt cgacgtggtg ctcaccgata tcaacatgcc gaacatgaac 2220 
ggatacgagc taaccgcgga gctacggcgc caagggttcc ggcagccgat catcggcgcg 2280 
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acggcgaacg ccatgcgtga ggagcgcgag cgctgcatgt ccgccgggat gaacgattgc 2340 
ctggtcaaac cggtggatct gaatgccctt cagaactgct tgattaatat tctcaaggtg 2400 
gatcgatga 2409 

<210> 15 
<211> 2160 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 15 

atgttgcgcc ggaatgatgc cggaaactac atgctgctgg atcagcatgg gcaggtcgta 60 
ctcgctacgg acgcagaggc gctggggagc ggtgcgtcgc ggacgctttt gcgtggagac 120 
ggcttcggtt tcatcggtgc tggcccactg ccgcagcata tggtgctttt ccagcacgtg 180 
gggtcttcga gctgggatct gatctatcac atcggtatcg gtcgcctgtt gctggctctg 240 
tggctccctc tgttacttgc ctctgcgttg gcactcgcag tcggcatcct actgcattgg 300 
ctggtgcgga gcatcgagcg acgcttgata gagcccgcaa agcgacgcct tgaagcattg 360 
aaggagagcg aagccttttc ccgtgcagtt atccaggccg cgcccgtcgc gctgtgcgtg 420 
ctgcgtcgtg ccgacgccgc agtggtcctg gaaaatcccc aggcgcgcca atggctgggt 480 
gatagcgagg cgattgccca cgacgcgccg agatggattt cccaggcgtt cgcaggaggt 540 
gtgaagtgtt ctggagaaga actggaaacc gaggcagggc tacatcttca tctcaattac 600 
acgcccaccc gctataacgg tgaagacgta ttgttctgcg ccttcagtga aatcagtgca 660 
cgcaagcgga tggaggcgga actggctcgc gcaaaatccc tggcggatgc tgccaatgaa 720 
gccaagacgc tgtttctcgc caccatgagc catgaaatcc gcacacctct gtacggcatg 780 
cttggcacgc ttgagctgct tgggcgtacc gagctgagtc ggcagcaggc cggttaccta 840 
aaggcaatcc agcattcctc gtcgaccctg ctgcaactga tcagcgatgt gcttgacgta 900 
tccaagatag aggccggcca actggaccta gagtgcgtgg aattctcccc gctggaattg 960 
accgaagagg tcgtgcagtc gttcaccggt gccgcgcagg ccaaggggct gcagttgtat 1020 
acctgcctct ctgcggagct gccgctgcgc atgcgggggg ccgcggcgtc gatccggcag 1080 
attctcaaca acctgctgag caacgcggtg aagttcaccg acaatggcta tgtcaacgtc 1140 
cacctgaagg ccagcgtggt cgatgccgaa tgtgtgatgc tgacctggca ggtcaacgat 1200 
accggcatgg ggatcaacgt cgaggatcag ccgcgtctgt tcgaaccgtt ctaccagata 1260 
cgccgctccg agcatccggt cgcaggcacg ggcctcggct tgtcgatcag ccagcgcctg 1320 
gcgcagctaa tgaatggcag tctgaaactg gtcagtgagc tggggttggg cagcagcttt 1380 
agcctcaggc ttccgcttga gcggatcgcg atgcaggctg agccgcagga cctagccggg 144 0 
tgcgccgtcc aagtgctggc gcctgtccgc gacctaacgg aatgcctgtg tggctggatc 1500 
tcccgctggg gtggaagggc catggtcgcg acgccgaggt cgctggacga ggcggacgcg 1560 
acctcgctgc tggtcgaagt gttactgctg gagggggcgc cgatgttcga agcatggcca 1620 
ggatgccggg tggagctttc ccctcagggt gatatggagc cgcaggcaca gggccgcgac 1680 
tggctgctcg ggctcaacaa cctggacggc ctgcatcgtg ctctgggcct ggcccatggg 1740 
cgtctcgctg atccttcgac gccgccgata cggctggctc cgttgcgcaa tctaggtctc 1800 
cgcgtcctag tggtggagga taacgcgatc aaccagttga tcttgaggga ccagatggaa 1860 
gcgctgggct gcagcgtgga gctgctcttc gatggtcgcg aggcgttgct gcactgccag 1920 
acggcctgct tcgacgtggt gctcaccgat atcaacatgc cgaacatgaa cggatacgag 1980 
ctaaccgcgg agctacggcg ccaagggttc cggcagccga tcatcggcgc gacggcgaac 2040 
gccatgcgtg aggagcgcga gcgctgcatg tccgccggga tgaacgattg cctggtcaaa 2100 
ccggtggatc tgaatgccct tcagaactgc ttgattaata ttctcaaggt ggatcgatga 2160 



<210> 16 
<211> 2130 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 16 

atgctgctgg atcagcatgg gcaggtcgta ctcgctacgg acgcagaggc gctggggagc 60 
ggtgcgtcgc ggacgctttt gcgtggagac ggcttcggtt tcatcggtgc tggcccactg 120 
ccgcagcata tggtgctttt ccagcacgtg gggtcttcga gctgggatct gatctatcac 180 
atcggtatcg gtcgcctgtt gctggctctg tggctccctc tgttacttgc ctctgcgttg 24 0 
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gcactcgcag tcggcatcct actgcattgg ctggtgcgga gcatcgagcg acgcttgata 300 
gagcccgcaa agcgacgcct tgaagcattg aaggagagcg aagccttttc ccgtgcagtt 360 
atccaggccg cgcccgtcgc gctgtgcgtg ctgcgtcgtg ccgacgccgc agtggtcctg 420 
gaaaatcccc aggcgcgcca atggctgggt gatagcgagg cgattgccca cgacgcgccg 480 
agatggattt cccaggcgtt cgcaggaggt gtgaagtgtt ctggagaaga actggaaacc 540 
gaggcagggc tacatcttca tctcaattac acgcccaccc gctataacgg tgaagacgta 600 
ttgttctgcg ccttcagtga aatcagtgca cgcaagcgga tggaggcgga actggctcgc 660 
gcaaaatccc tggcggatgc tgccaatgaa gccaagacgc tgtttctcgc caccatgagc 720 
catgaaatcc gcacacctct gtacggcatg cttggcacgc ttgagctgct tgggcgtacc 780 
gagctgagtc ggcagcaggc cggttaccta aaggcaatcc agcattcctc gtcgaccctg 840 
ctgcaactga tcagcgatgt gcttgacgta tccaagatag aggccggcca actggaccta 900 
gagtgcgtgg aattctcccc gctggaattg accgaagagg tcgtgcagtc gttcaccggt 960 
gccgcgcagg ccaaggggct gcagttgtat acctgcctct ctgcggagct gccgctgcgc 1020 
atgcgggggg ccgcggcgtc gatccggcag attctcaaca acctgctgag caacgcggtg 1080 
aagttcaccg acaatggcta tgtcaacgtc cacctgaagg ccagcgtggt cgatgccgaa 1140 
tgtgtgatgc tgacctggca ggtcaacgat accggcatgg ggatcaacgt cgaggatcag 1200 
ccgcgtctgt tcgaaccgtt ctaccagata cgccgctccg agcatccggt cgcaggcacg 1260 
ggcctcggct tgtcgatcag ccagcgcctg gcgcagctaa tgaatggcag tctgaaactg 1320 
gtcagtgagc tggggttggg cagcagcttt agcctcaggc ttccgcttga gcggatcgcg 1380 
atgcaggctg agccgcagga cctagccggg tgcgccgtcc aagtgctggc gcctgtccgc 1440 
gacctaacgg aatgcctgtg tggctggatc tcccgctggg gtggaagggc catggtcgcg 1500 
acgccgaggt cgctggacga ggcggacgcg acctcgctgc tggtcgaagt gttactgctg 1560 
gagggggcgc cgatgttcga agcatggcca ggatgccggg tggagctttc ccctcagggt 1620 
gatatggagc cgcaggcaca gggccgcgac tggctgctcg ggctcaacaa cctggacggc 1680 
ctgcatcgtg ctctgggcct ggcccatggg cgtctcgctg atccttcgac gccgccgata 1740 
cggctggctc cgttgcgcaa tctaggtctc cgcgtcctag tggtggagga taacgcgatc 1800 
aaccagttga tcttgaggga ccagatggaa gcgctgggct gcagcgtgga gctgctcttc 1860 
gatggtcgcg aggcgttgct gcactgccag acggcctgct tcgacgtggt gctcaccgat 1920 
atcaacatgc cgaacatgaa cggatacgag ctaaccgcgg agctacggcg ccaagggttc 1980 
cggcagccga tcatcggcgc gacggcgaac gccatgcgtg aggagcgcga gcgctgcatg 204 0 
tccgccggga tgaacgattg cctggtcaaa ccggtggatc tgaatgccct tcagaactgc 2100 
ttgattaata ttctcaaggt ggatcgatga 2130 

<210> 17 
<211> 2001 
<212> DNA 

<213> Pseudimonas aeruginosa PA14 
<400> 17 

atggtgcttt tccagcacgt ggggtcttcg agctgggatc tgatctatca catcggtatc 60 

ggtcgcctgt tgctggctct gtggctccct ctgttacttg cctctgcgtt ggcactcgca 120 

gtcggcatcc tactgcattg gctggtgcgg agcatcgagc gacgcttgat agagcccgca 180 

aagcgacgcc ttgaagcatt gaaggagagc gaagcctttt cccgtgcagt tatccaggcc 240 

gcgcccgtcg cgctgtgcgt gctgcgtcgt gccgacgccg cagtggtcct ggaaaatccc 300 

caggcgcgcc aatggctggg tgatagcgag gcgattgccc acgacgcgcc gagatggatt 360 

tcccaggcgt tcgcaggagg tgtgaagtgt tctggagaag aactggaaac cgaggcaggg 420 

ctacatcttc atctcaatta cacgcccacc cgctataacg gtgaagacgt attgttctgc 480 

gccttcagtg aaatcagtgc acgcaagcgg atggaggcgg aactggctcg cgcaaaatcc 540 

ctggcggatg ctgccaatga agccaagacg ctgtttctcg ccaccatgag ccatgaaatc 600 

cgcacacctc tgtacggcat gcttggcacg cttgagctgc ttgggcgtac cgagctgagt 660 

cggcagcagg ccggttacct aaaggcaatc cagcattcct cgtcgaccct gctgcaactg 720 

atcagcgatg tgcttgacgt atccaagata gaggccggcc aactggacct agagtgcgtg 780 

gaattctccc cgctggaatt gaccgaagag gtcgtgcagt cgttcaccgg tgccgcgcag 840 

gccaaggggc tgcagttgta tacctgcctc tctgcggagc tgccgctgcg catgcggggg 900 

gccgcggcgt cgatccggca gattctcaac aacctgctga gcaacgcggt gaagttcacc 960 

gacaatggct atgtcaacgt ccacctgaag gccagcgtgg tcgatgccga atgtgtgatg 102 0 

ctgacctggc aggtcaacga taccggcatg gggatcaacg tcgaggatca gccgcgtctg 1080 

ttcgaaccgt tctaccagat acgccgctcc gagcatccgg tcgcaggcac gggcctcggc 1140 
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ttgtcgatca gccagcgcct ggcgcagcta atgaatggca gtctgaaact ggtcagtgag 1200 
ctggggttgg gcagcagctt tagcctcagg cttccgcttg agcggatcgc gatgcaggct 1260 
gagccgcagg acctagccgg gtgcgccgtc caagtgctgg cgcctgtccg cgacctaacg 1320 
gaatgcctgt gtggctggat ctcccgctgg ggtggaaggg ccatggtcgc gacgccgagg 1380 
tcgctggacg aggcggacgc gacctcgctg ctggtcgaag tgttactgct ggagggggcg 1440 
ccgatgttcg aagcatggcc aggatgccgg gtggagcttt cccctcaggg tgatatggag 1500 
ccgcaggcac agggccgcga ctggctgctc gggctcaaca acctggacgg cctgcatcgt 1560 
gctctgggcc tggcccatgg gcgtctcgct gatccttcga cgccgccgat acggctggct 1620 
ccgttgcgca atctaggtcf ccgcgtccta gtggtggagg ataacgcgat caaccagttg 1680 
atcttgaggg accagatgga agcgctgggc tgcagcgtgg agctgctctt cgatggtcgc 1740 
gaggcgttgc tgcactgcca gacggcctgc ttcgacgtgg tgctcaccga tatcaacatg 1800 
ccgaacatga acggatacga gctaaccgcg gagctacggc gccaagggtt ccggcagccg 1860 
atcatcggcg cgacggcgaa cgccatgcgt gaggagcgcg agcgctgcat gtccgccggg 1920 
atgaacgatt gcctggtcaa accggtggat ctgaatgccc ttcagaactg cttgattaat 1980 
attctcaagg tggatcgatg a 2001 

<210> 18 
<211> 1491 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 18 

atggaggcgg aactggctcg cgcaaaatcc ctggcggatg ctgccaatga agccaagacg 60 
ctgtttctcg ccaccatgag ocatgaaatc cgcacacctc tgtacggcat gcttggcacg 120 
cttgagctgc ttgggcgtac cgagctgagt cggcagcagg ccggttacct aaaggcaatc 180 
cagcattcct cgtcgaccct gctgcaactg atcagcgatg tgcttgacgt atccaagata 240 
gaggccggcc aactggacct agagtgcgtg gaattctccc cgctggaatt gaccgaagag 300 
gtcgtgcagt cgttcaccgg tgccgcgcag gccaaggggc tgcagttgta tacctgcctc 360 
tctgcggagc tgccgctgcg catgcggggg gccgcggcgt cgatccggca gattctcaac 420 
aacctgctga gcaacgcggt gaagttcacc gacaatggct atgtcaacgt ccacctgaag 480 
gccagcgtgg tcgatgccga atgtgtgatg ctgacctggc aggtcaacga taccggcatg 540 
gggatcaacg tcgaggatca gccgcgtctg ttcgaaccgt tctaccagat acgccgctcc 600 
gagcatccgg tcgcaggcac gggcctcggc ttgtcgatca gccagcgcct ggcgcagcta 660 
atgaatggca gtctgaaact ggtcagtgag ctggggttgg gcagcagctt tagcctcagg 720 
cttccgcttg agcggatcgc gatgcaggct gagccgcagg acctagccgg gtgcgccgtc 780 
caagtgctgg cgcctgtccg cgacctaacg gaatgcctgt gtggctggat ctcccgctgg 840 
ggtggaaggg ccatggtcgc gacgccgagg tcgctggacg aggcggacgc gacctcgctg 900 
ctggtcgaag tgttactgct ggagggggcg ccgatgttcg aagcatggcc aggatgccgg 960 
gtggagcttt cccctcaggg tgatatggag ccgcaggcac agggccgcga ctggctgctc 1020 
gggctcaaca acctggacgg cctgcatcgt gctctgggcc tggcccatgg gcgtctcgct 1080 
gatccttcga cgccgccgat acggctggct ccgttgcgca atctaggtct ccgcgtccta 1140 
9 fc 99tggagg ataacgcgat caaccagttg atcttgaggg accagatgga agcgctgggc 1200 
tgcagcgtgg agctgctctt cgatggtcgc gaggcgttgc tgcactgcca gacggcctgc 1260 
ttcgacgtgg tgctcaccga tatcaacatg ccgaacatga acggatacga gctaaccgcg 1320 
gagctacggc gccaagggtt ccggcagccg atcatcggcg cgacggcgaa cgccatgcgt 1380 
gaggagcgcg agcgctgcat gtccgccggg atgaacgatt gcctggtcaa accggtggat 1440 
ctgaatgccc ttcagaactg cttgattaat attctcaagg tggatcgatg a 14 91 

<210> 19 
<211> 931 
<212> PRT 

<213> Pseudomonas aeruginosa PA14 
<400> 19 

Met Lys Leu Lys Asn Phe Leu Gin Pro Phe Asp Ser Gly Phe Ser Thr 

1 5 10 15 

Pro Ser Ala Ala Leu Lys Leu Leu Arg Met Leu Gly Gly Ala Leu Met 
20 25 30 
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Leu 


Cys 


Val 


Leu 


Cys 


Ser 


Leu 


He 






35 










40 


His 


Gin 


Val 


Ser 


Leu 


Ser 


Arg 


Gin 




50 










55 




Ala 


Gin 


Leu 


Tyr 


Phe 


Glu 


Gin 


Arg 


65 










70 






Gly 


Asn 


Val 


Val 


Pro 


Leu 


Ala 


Ala 








85 








Pro 


Asn 


Asn 


Val 


Ser 


He 


Leu 


Pro 








100 










Leu 


Leu 


Thr 


Ala 


Arg 


Thr 


Leu 


Gly 






115 










120 


Leu 


Met 


Tyr 


Leu 


Val 


Asp 


Thr 


Asp 




130 










135 




Thr 


Ala 


Asp 


Gly 


Arg 


Pro 


Ser 


Ala 


145 










150 






Glu 


Val 


Tyr 


Arg 


Ala 


Leu 


Leu 


Ala 










165 








Val 


Thr 


Asp 


Gly 


Gly 


Thr 


Pro 


Gin 








180 










Gly 


Asp 


Glu 


Pro 


Gly 


Glu 


Gly 


Trp 






195 










200 


Asp 


Leu 


Asp 


Ser 


Met 


Leu 


Arg 


Arg 




210 










215 




Leu 


Asp 


Gin 


His 


Gly 


Gin 


Val 


Val 


225 










230 






Gly 


Ser 


Gly 


Ala 


Ser 


Arg 


Thr 


Leu 










245 








He 


Gly 


Ala 


Gly 


Pro 


Leu 


Pro 


Gin 








260 










Gly 


Ser 


Ser 


Ser 


Trp 


Asp 


Leu 


He 






275 










280 


Leu 


Leu 


Ala 


Leu 


Trp 


Leu 


Pro 


Leu 




290 










295 




Ala 


Val 


Gly 


He 


Leu 


Leu 


His 


Trp 


305 










310 






Leu 


He 


Glu 


Pro 


Ala 


Lys 


Arg 


Arg 










325 








Ala 


Phe 


Ser 


Arg 


Ala 


Val 


He 


Gin 








340 










Leu 


Arg 


Arg 


Ala 


Asp 


Ala 


Ala 


Val 






355 


Gly 








360 


Gin 


Trp 


Leu 


Asp 


Ser 


Glu 


Ala 




370 










375 




He 


Ser 


Gin 


Ala 


Phe 


Ala 


Gly 


Gly 


385 










390 






Glu 


Thr 


Glu 


Ala 


Gly 


Leu 


His 


Leu 










405 








Tyr 


Asn 


Gly 


Glu 


Asp 


Val 


Leu 


Phe 








420 










Arg 


Lys 


Arg 


Met 


Glu 


Ala 


Glu 


Leu 






435 










440 


Ala 


Ala 


Asn 


Glu 


Ala 


Lys 


Thr 


Leu 




450 










455 




He 


Arg 


Thr 


Pro 


Leu 


Tyr 


Gly 


Met 


465 










470 






Arg 


Thr 


Glu 


Leu 


Ser 


Arg 


Gin 


Gin 










485 









pne 


Ser 


vax 


Ser 


jyieu 
45 


vax 


Leu 


Asn 


Ala 


Met 


Asn 


Val 
60 


Ala 


Met 


Tyr 


Glu 


Glu 


Ala 


Leu 
75 


Leu 


Asn 


His 


Leu 


Ser 
80 


Gly 


Arg 
90 


Ala 


Leu 


Val 


Asn 


Glu 
95 


Ala 


Leu 


Ser 


Asp 


Gly 


Gly 


Arg 


Gly 


Leu 


105 










110 






Asp 


Leu 


Arg 


Glu 


Lys 
125 


Arg 


Leu 


Ala 


Lys 


Gly 


Pro 


Leu 
140 


Val 


Tyr 


Arg 


Leu 


Ala 


He 


Ser 
155 


Ser 


Thr 


He 


Thr 


Lys 
160 


Thr 


Pro 
170 


Ser 


Ala 


Pro 


Val 


His 
175 


Trp 


Arg 


Leu 


Tyr 


Leu 


Phe 


Glu 


Ser 


Leu 


185 










190 






Leu 


Gly 


Leu 


Glu 


lie 
205 


Leu 


Gly 


Glu 


Asn 


Asp 


Ala 


Gly 
220 


Asn 


Tyr 


Met 


Leu 


Leu 


Ala 


Thr 
235 


Asp 


Ala 


Glu 


Ala 


Leu 
240 


Leu 


Arg 
250 


Gly 


Asp 


Gly 


Phe 


Gly 
255 


Phe 


His 


Met 


Val 


Leu 


Phe 


Gin 


His 


Val 


265 










270 






Tyr 


His 


lie 


Gly 


He 
285 


Gly 


Arg 


Leu 


Leu 


Leu 


Ala 


Ser 
300 


Ala 


Leu 


Ala 


Leu 


Leu 


Val 


Arg 
315 


Ser 


He 


Glu 


Arg 


Arg 
320 


Leu 


Glu 
330 


Ala 


Leu 


Lys 


Glu 


Ser 
335 


Glu 


Ala 


Ala 


Pro 


Val 


Ala 


Leu 


Cys 


Val 


345 










350 






Val 


Leu 


Glu 


Asn 


Pro 
365 


Gin 


Ala 


Arg 


He 


Ala 


His 


Asp 
380 


Ala 


Pro 


Arg 


Trp 


Val 


Lys 


Cys 
395 


Ser 


Gly 


Glu 


Glu 


Leu 
400 


His 


Leu 
410 


Asn 


Tyr 


Thr 


Pro 


Thr 
415 


Arg 


Cys 


Ala 


Phe 


Ser 


Glu 


He 


Ser 


Ala 


425 










430 






Ala 


Arg 


Ala 


Lys 


Ser 
445 


Leu 


Ala 


Asp 


Phe 


Leu 


Ala 


Thr 
460 


Met 


Ser 


His 


Glu 


Leu 


Gly 


Thr 
475 


Leu 


Glu 


Leu 


Leu 


Gly 
480 


Ala 


Gly 
490 


Tyr 


Leu 


Lys 


Ala 


He 
495 


Gin 
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His Ser Ser Ser Thr Leu 


Leu Gin Leu 


He 


Ser Asp Val 


Leu 


Asp 


Val 


500 


505 






510 




Ser Lys He Glu Ala Gly 


Gin Leu Asp 


Leu 


Glu Cys Val 


Glu 


Phe 


Ser 


515 


520 




525 








Pro Leu Glu Leu Thr Glu 


Glu Val Val 


Gin 


Ser Phe Thr 


Glv 


Ala 


Ala 


530 


535 




540 






Gin Ala Lys Gly Leu Gin 


Leu Tyr Thr 


Cvs 


Leu Ser Ala 


Glu 


Leu 


Pro 


545 550 






555 








Leu Arg Met Arg Gly Ala 


Ala Ala Ser 


He 


Ara Gin Tie 
wAAJi x-i.^ 


Leu 


7\ cm 
noil 


-Moll 


565 




570 










Leu Leu Ser Asn Ala Val 


Lys Phe Thr 


Asd 


Asn Glv Tvr 


Val 


Asn 


Val 
v ax 


580 


585 






590 






His Leu Lys Ala Ser Val 


Val Asp Ala 


Glu 


Cys Val Met 


Leu 


Thr 


lip 


595 


600 




U v J 






Gin Val Asn Asp Thr Gly 


Met Gly He 


Asn 


Val Glu Asn 

v»j. uxu rioy 


Gin 


Pro 




610 


615 




620 






Leu Phe Glu Pro Phe Tyr 


Gin He Arg 


Arcr 


Ser Glu His 


Pro 


Val 


Ala 


625 630 






635 






640 


Gly Thr Gly Leu Gly. Leu 


Ser He Ser 


Gin 


Ara Leu Ala 


Gin 


Leu 


Met 


645 




650 






5R 

D D ZJ 




Asn Gly Ser Leu Lys Leu 


Val Ser Glu 


Leu 


Glv Leu Gl v 


Ser 

OCX 


Ser 

OCX 


Phe 

tr lie 


660 


665 




D / U 






Ser Leu Ara Leu Pro Leu 


Glu Ara He 


Ala 


Met Gin Ala 

ncu vxii Axa 


Glu 


Pro 


Gin 


675 






foe 








Asp Leu Ala Gly Cys Ala 


Val Gin Val 

»Mi WiLll VUX 


Leu 


Ala Pro Val 


Ara 


Asp 


Leu 


690 


695 




700 








Thr Glu Cys Leu Cys Gly 


Trp He Ser 


Ara 


Tro Glv Glv 


Arg 


Ala 


Met 


705 710 






715 






72 0 


Val Ala Thr Pro Arg Ser 


Leu Asp Glu 


Ala 


Asp Ala Thr 


Ser 


Leu 


Leu 


725 




730 






735 




Val Glu Val Leu Leu Leu 


Glu Gly Ala 


Pro 


Met Phe Glu 


Ala 


Trrj 
xx y 


Pro 


74 0 


745 






750 






Gly Cys Arg Val Glu Leu 


Ser Pro Gin 


Glv 


Asp Met Glu 


Pro 


Gin 


Ala 


755 


760 












Gin Glv Ara Asd Tro Leu 


Leu Glv Leu 


Asn 


Asn Leu Asn 

aj^»u no^ 


Glv 


Leu 


His 


770 


775 




780 








Ara Ala Leu Glv Leu Ala 


His Glv Ara 


Leu 


AT a Asr> Prn 


Ser 


Thr 


Pro 


785 790 






795 






flOO 


Pro He Ara Leu Ala Pro 


Leu Ara Asn 


Leu 


Glv Leu Ara 


Val 


Leu 


Val 


805 




RIO 

O X v 






Oi J 




Val Glu Asp Asn Ala lie 


Asn Gin Leu 


He 


Leu Ara Asn 


Gin 


Met 


Glu 


820 


825 




830 






Ala Leu Gly Cys Ser Val 


Glu Leu Leu 


Phe 


z\cn Glv Ara 


Glu 


Ala 


Leu 


835 


840 




O *± 3 








Leu His Cvs Gin Thr Ala 


CVS Phe A q-r\ 

>«»jr 9 IT JL AG nop 


Val 

v ax 


Va T Tien Thr 
vai ucu ± i -li- 


Ann 

nay/ 


Tie 

X xc 




850 


855 




ft £ n 








Met Pro Asn Met Asn Gly 


Tyr Glu Leu 


Thr 


Ala Glu Leu 


Arg 


Arg 


Gin 


865 870 






875 






880 


Gly Phe Arg Gin Pro He 


He Gly Ala 


Thr 


Ala Asn Ala 


Met 


Arg 


Glu 


885 




890 






895 




Glu Arg Glu Arg Cys Met 


Ser Ala Gly 


Met 


Asn Asp Cys 


Leu 


Val 


Lys 


900 


905 






910 






Pro Val Asp Leu Asn Ala 


Leu Gin Asn 


Cys 


Leu He Asn 


He 


Leu 


Lys 


915 


920 




925 









Val Asp Arg 
930 
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<210> 20 
<211> 906 
<212> PRT 

<213> Pseudomonas aeruginosa PA14 



<400> 20 





Leu 




vjiy 


Ala 


Leu 


Mot* 

roe t. 


Leu 


L.ys vai 


jjcu uys oer lieu ±±e 


rrne 


1 








b 








10 


1 c 
lb 




Ser 


vai 


Ser 




Val 


Leu 


Asn 


HIS 


p1 n T7~ 1 

bin vai 


ocl IjcU be! AJ_y Vjin 


Aia 








20 










25 


J 0 




Met- 


Asn 


vai 


Ala 

Ala. 


net 


Tyr 


ulU 


Aia 


Gin Leu 


• iS Tl Vi cs. ^51 ii P. 1 *-» 7\ V/*"t 

iyr irne uiu bin Axg 


pi 
ulu 






35 










A A 

40 




A C 




al =a 

Aid 


Leu 


Leu 


Asn 


XllS 


Leu 


Ser 


my 


Asn vai 


Val irxO lieu Ala Ala 


Pi . . 

ijiy 




50 










55 






C A 

oO 




Arg 


Ala 


Leu 


Val 


Asn 


pi 

blU 


Ala 


Pro 


Asn Asn 


Val ocj. lie JjclL iriO 


Leu 


65 










70 








75 


O A 

80 


oer 


A an 

ASp 


Gly 


Giy 


Arg 


pi - , 

Giy 


Leu 


Leu 


Leu Thr 


Aia Arg inr i>eu Giy 


Asp 










85 








90 


95 




Leu 


Arg 


Glu 


Lys 


Arg 


Leu 


Ala 


Leu 


Met Tyr 


Leu Val Asp Thr Asp 


Lys 








100 










105 


i i n 
110 




Giy 


Pro 


Leu 


vai 


Tyr 


Arg 


Leu 


inr 


Ala Asp 


Lriy Arg fro oer Aia 


Aia 






115 










120 




IOC 

125 




lie 


Ser 


Ser 


Thr 


lie 


Thr 


Lys 


pi ii 
GlU 


Val Tyr 


Arg Ala Leu Leu Ala 


Thr 




130 










135 






1 A A 
140 




Pro 


Ser 


7V 1 — 

Ala 


Pro 


vai 


TT »! — 

lilS 


Trp 


TT-. 1 

vai 


Thr Asp 


Gly Gly Thr Pro Gin 


Arg 


145 










150 








155 


160 


Leu 


Tyr 


Leu 


Pne 


GlU 


Ser 


Leu 


Gly 


Asp Glu 


Pro Gly Glu Gly Trp 


Leu 










165 








170 


1 nr 

175 




Pi <»»■ 

Giy 


Leu 


G1U 


lie 


Leu 


Giy 


pi 

blU 


Asp 


Leu Asp 


oer He t lieu Arg Arg 


Asn 








180 










185 


190 




Asp 


Ala 


Gly 


Asn 


Tyr 


Met 


Leu 


Leu 


Asp Gin 


TT J _ Pi,, pi _ TT_ 1 TT_ 1 

His Gly Gin val vai 


Leu 






195 










200 




205 




Ala 


Tnr 


Asp 


Ala 


G1U 


7\ 1 

Ala 


Leu 


p i 
Giy 


Ser Gly 


Ala Ser Arg Thr Leu 


Leu 




210 










215 






220 




Arg 


Gly 


Asp 


Gly 


pne 


pi,. 
Gly 


Phe 


Tl - 

lie 


p i , _ > 1 _ 
Giy Aia 


Gly Pro Leu Pro Gin 


TT ' 

HIS 


225 










230 








235 


240 


Met 


TT — 1 

val. 


Leu 


Pne 


pi — 
Gin 


His 


TT- 1 

val 


Pi i r 

Gly 


Ser Ser 


Ser Trp Asp Leu He 


Tyr 










245 








250 


255 




His 


T 1 „ 

lie 


Gly 


lie 


pi 
Gly 


Arg 


Leu 


Leu 


Leu Ala 


Leu Trp Leu Pro Leu 


Leu 








260 










265 


270 




Leu 


Ala 


Ser 


7\ 1 — 

Ala 


T «i i 

Leu 


Ala 


Leu 


Aia 


t r _ i pi,. 

vai Giy 


He Leu Leu His Trp 


Leu 






275 










280 




285 




Val 


Arg 


Ser 


lie 


Glu 


Arg 


Arg 


Leu 


He Glu 


Pro Ala Lys Arg Arg 


Leu 




290 










295 






300 




Glu 


Ala 


Leu 


Lys 


Glu 


Ser 


Glu 


Ala 


Phe Ser 


Arg Ala Val He Gin 


Ala 


305 










310 








315 


320 


Ala 


Pro 


Val 


Ala 


Leu 




Val 


Leu 


Arg Arg 


Ala Asp Ala Ala Val 


Val 










325 








330 


335 




Leu 


Glu 


Asn 


Pro 


Gin 


Ala 


Arg 


Gin 


Trp Leu 


Gly Asp Ser Glu Ala 


He 








340 










345 


350 




Ala 


His 


Asp 


Ala 


Pro 


Arg 


Trp 


He 


Ser Gin 


Ala Phe Ala Gly Gly 


Val 






355 










360 




365 




Lys 


Cys 


Ser 


Gly 


Glu 


Glu 


Leu 


Glu 


Thr Glu 


Ala Gly Leu His Leu 


His 




370 










375 






380 




Leu 


Asn 


Tyr 


Thr 


Pro 


Thr 


Arg 


Tyr 


Asn Gly 


Glu Asp Val Leu Phe 


Cys 


385 










390 








395 


400 


Ala 


Phe 


Ser 


Glu 


He 


Ser 


Ala 


Arg 


Lys Arg 


Met Glu' Ala Glu Leu 
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80 


Glu 


Ala 


Gly 


Gin 


Leu 


Asp 


Leu 


Glu 


Cys 


Val 


Glu 


Phe 


Ser 


Pro 


Leu 


Glu 










85 










90 










95 




Leu 


Thr 


Glu 


Glu 


val 


Val 


Gin 


Ser 


Phe 


Thr 


Gly 


Ala 


Ala 


Gin 


Ala 


Lys 








100 










105 










110 






Gly 


Leu 


Gin 


Leu 


Tyr 


Thr 


Cys 


Leu 


Ser 


Ala 


Glu 


Leu 


Pro 


Leu 


Arg 


Met 






115 










120 










125 








Arg 


Gly 


Ala 


Ala 


Ala 


Ser 


He 


Arg 


Gin 


He 


Leu 


Asn 


Asn 


Leu 


Leu 


Ser 




130 










135 










140 










Asn 


Ala 


Val 


Lys 


Phe 


Thr 


Asp 


Asn 


Gly 


Tyr 


Val 


Asn 


Val 


His 


Leu 


Lys 


145 










150 










155 










160 


Ala 


Ser 


Val 


Val 


Asp 


Ala 


Glu 


Cys 


Val 


Met 


Leu 


Thr 


Trp 


Gin 


Val 


Asn 










165 










170 








175 




Asp 


Thr 


Gly 


Met 


Gly 


He 


Asn 


Val 


Glu 


Asp 


Gin 


Pro 


Arg 


Leu 


Phe 


Glu 








180 










185 










190 






Pro 


Phe 


Tyr 


Gin 


lie 


Arg 


Arg 


Ser 


Glu 


His 


Pro 


Val 


Ala 


Gly 


Thr 


Gly 






195 










200 










205 








Leu 


Gly 


Leu 


Ser 


He 


Ser 


Gin 


Arg 


Leu 


Ala 


Gin 


Leu 


Met 


Asn 


Gly 


Ser 




210 










215 










220 










Leu 


Lys 


Leu 


Val 


Ser 


Glu 


Leu 


Gly 


Leu 


Gly 


Ser 


Ser 


Phe 


Ser 


Leu 


Arg 


225 










230 










235 










240 


Leu 


Pro 


Leu 


Glu 


Arg 


He 


Ala 


Met 


Gin 


Ala 


Glu 


Pro 


Gin 


Asp 


Leu 


Ala 










245 










250 










255 




Gly 


Cys 


Ala 


Val 


Gin 


Val 


Leu 


Ala 


Pro 


Val 


Arg 


Asp 


Leu 


Thr 


Glu 


Cys 








260 










265 










270 






Leu 


Cys 


Gly 


Trp 


He 


Ser 


Arg 


Trp 


Gly 


Gly 


Arg 


Ala 


Met 


Val 


Ala 


Thr 






275 










280 










285 








Pro 


Arg 


Ser 


Leu 


Asp 


Glu 


Ala 


Asp 


Ala 


Thr 


Ser 


Leu 


Leu 


Val 


Glu 


Val 




290 










295 










300 










Leu 


Leu 


Leu 


Glu 


Gly 


Ala 


Pro 


Met 


Phe 


Glu 


Ala 


Trp 


Pro 


Gly 


Cys 


Arg 


305 










310 










315 










320 


Val 


Glu 


Leu 


Ser 


Pro 


Gin 


Gly 


Asp 


Met 


Glu 


Pro 


Gin 


Ala 


Gin 


Gly 


Arg 
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325 330 335 



Asp 


Trp 


Leu 


Jjcu vj-L y 


Leu 


Asn 


Asn 


Leu 


Asp 


(jiy 


Leu 


TT A r-i 
HIS 


Arg 


Ala 


Leu 








J fi U 


















*a c a 
3 50 






Hi ir 

vjiy 


Leu 


Til a 
Ala 


rllS vsiy 


Arg 


Leu 


Aia 


Asp 


Pro 


Ser 


mr 


Pro 


Pro 


He 


Arg 






one 
J 3D 








jDU 










*3 £ c: 
•job 






Leu 


Ala 


Pro 


Leu Arg 


Asn 


Leu 


/IT i r 

Giy 


Leu 


Arg 


val 


Leu 


.Val 


Val 


Glu 


Asp 




j /U 








o /b 










O O A 










Asn 


Aia. 


lie 


Asn Gin 


Leu 


lie 


Leu 


TV 

Arg 


Asp 


Gin 


Met 


blU 


Ala 


Leu 


Gly 
































Cys 


Ser 


vai 


Glu Leu 


Leu 


fne 


Asp 


C3iy 


Arg 


GlU 


Ala 


Leu 


Leu 


His 


Cys 








405 










410 










415 




Gin 


Thr 


Ala 


Cys Phe 


Asp 


Val 


Val 


Leu 


Thr 


Asp 


He 


Asn 


Met 


Pro 


Asn 








420 








425 










430 






Met 


Asn 


Gly 


Tyr Glu 


Leu 


Thr 


Ala 


Glu 


Leu 


Arg 


Arg 


Gin 


Gly 


Phe 


Arg 






435 








440 










445 








Gin 


Pro 


He 


He Gly 


Ala 


Thr 


Ala 


Asn 


Ala 


Met 


Arg 


Glu 


Glu 


Arg 


Glu 




450 








455 










460 










Arg 


Cys 


Met 


Ser Ala 


Gly 


Met 


Asn 


Asp 


Cys 


Leu 


Val 


Lys 


Pro 


Val 


Asp 


465 








470 










475 










.480 


Leu 


Asn 


Ala 


Leu Gin 


Asn 


Cys 


Leu 


He 


Asn 


He 


Leu 


Lys 


Val 


Asp 


Arg 



485 490 495 



<210> 30 
<211> 1992 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 30 

atggatgtta taegggagea tgaggtattt ettgggegea tegctcgaaa. aagcgacaag 60 
accacccaga agtacgacta tgacgtggtg cctttgcagc ggcacttgtt ggcaaaggaa 120 
aacggattag eggtctatga gggaegggag ttttcctttg etatgecatt tctactggct 180 
accaagcacg cgttgagcgc cgattcctcg ggagatccgt tttegctegg tgtattgetc 240 
gecaatttet aeggaagett ctggagtgtt tccgcctatc ccgcgccaca gttactgatc 30 0 
tttgatcttt ccggcagcac ccgcctggca gtgeegtega ttccctccac agegcagegt 360 
gacaggttga geggaagcta tccgatgata gtcgagegea ttctggcgcg cttgcgcacc 420 
cggccggtgg gggaggaege tcagcgtgtc cattggatac gegctgateg ctatcgcgac 480 
teggegctgg agatgttggg agtcgcccgg gttgatctgc eggaaacact ctggtggcac 54 0 
gaegagcega accatctgat catcgctgcg agectgettg atetcaggeg aatcaatgac 600 
ttcgaacagt tggttgagcg cccggcattc gattegtaca gectggtate geeggatgge 660 
gaggtattgc tcggcgcggc ccctgcgacc ggcctgaggg atggcctgaa cctcacccga 720 
cagggggtcg ccgttcaact gcgcagccag cctgagaacg gctggctcgc ggtctaccga 780 
accgactacg gcaatttctt tcgccactcc cggtggctgg tggcaggtct gctgctgacc 84 0 
ccggcgctgc tcctggccgg ttggctcggg atgcgttggt acaccagcag cgtcgtcaac 900 
ccggtgcatc gggcgcaccg gcaactggtg gagagegaca ccttcagccg gaegctgata 960 
cagaccgcgc cggtggctct ggtggtgctg acccaggatg accagcaact ggtgacctgc 1020 
aaccacttgg ccgcccagtg gctgggcggg cccacggaga tccttgggct gacttccaac 1080 
tggaagcttt tegatgegeg tgggcaggta ccaggagaca tctgtatcca ggtcggtggg 1140 
cgctatttgc agaccgcctt cgcggcgacc cgctatgccg gcaccgaggc ggtactgtgc 1200 
gtattcaacg acatcaeggt ccactgcgag gcggagaccg cgctgtccaa tgegaagega 1260 
geageggatg ccgccagcca ggccaagacc ctgttcctgg cccgcatgag ccatgaaatc 132 0 
cgtactcccc tgtacggtgt ccttggcacc ctggagttgc tcgacctgac caccctgaac 13 80 
gageggcaac gcgcctacct acgcaccatc cagagttcgt ctgcgacgct catgeaactg 1440 
attagegatg tgctggatgt ctcgaagatc gaagegggge agatggctct gaccctggcc 1500 
gccttcaatc cgctggacct agtgcgggaa gtgcttggca actttgccgc cagcgccatg 1560 
gccaaggacc tgcaggtaga cccgctcgat actcttgcgc ttgaggegea ggtcgegcat 1620 
ggcttcgaag aaagcgttct gttcgaggtt gctggtggct cggtcggcca tttcgaagag 1680 
ggtgtcgtcg gcgttgtcga acaacgcctg caacgcctgt ttcagctgea gcgccgcctt 1740 
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gtcgcgcacc tgcacgagga tgaceggcag gcgccccgct ccggcgttcg gcgacggctc 1800 

ggaagcgacd ctggtcaggt gcaccacatt ggcatcgttc tgcatcggga ctctcctgcc 1860 

accctcgcgg ccgcgcatgg aatggcaaaa atcgggcaca gaggatcgat tggcgtcgtc 1920 

cgtaacgtca atttccaggc gtcaaaaaca agtatctaca ttcattatag agatactttc 1980 

aaatctagat ag 1992 

<210> 31 
<211> 1830 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 31 

atgccatttc tactggctac caagcacgcg ttgagegccg attcctcggg agatccgttt 60 
tcgctcggtg tattgctcgc caatttctac ggaagcttct ggagtgtttc cgcctatccc 120 
gcgccacagt tactgatctt tgatctttcc ggcagcaccc gcctggcagt gccgtcgatt 180 
ccctccacag cgcagcgtga caggttgagc ggaagctatc cgatgatagt cgagcgcatt 240 
ctggcgcgct tgcgcacccg gccggtgggg gaggacgctc agcgtgtcca ttggatacgc 300 
gctgatcgct atcgcgactc ggcgctggag atgttgggag tcgcccgggt tgatctgccg 360 
gaaacactct ggtggcacga cgagccgaac catctgatca tcgctgcgag cctgcttgat 420 
ctcaggcgaa tcaatgactt cgaacagttg gttgagcgcc cggcattcga ttcgtacagc 480 
ctggtatcgc cggatggcga ggtattgctc ggcgcggccc ctgcgaccgg cctgagggat 540 
ggcctgaacc tcacccgaca gggggtcgcc gttcaactgc gcagccagcc tgagaacggc 600 
tggctcgcgg tctaccgaac cgactacggc aatttctttc gccactcccg gtggctggtg 660 
gcaggtctgc tgctgacccc ggcgctgctc ctggccggt.t ggctcgggat gcgttggtac 720 
accagcagcg tcgtcaaccc ggtgcatcgg gcgcaccggc aactggtgga gagcgacacc 780 
ttcagccgga cgctgataca gaccgcgccg gtggctctgg tggtgctgac ccaggatgac 840 
cagcaactgg tgacctgcaa ccacttggcc gcccagtggc tgggcgggcc cacggagatc 900 
cttgggctga cttccaactg gaagcttttc gatgcgcgtg ggcaggtacc aggagacatc 960 
tgtatccagg tcggtgggcg ctatttgcag accgccttcg cggcgacccg ctatgccggc 1020 
accgaggcgg tactgtgcgt attcaacgac atcacggtcc actgcgaggc ggagaccgcg 1080 
ctgtccaatg cgaagcgagc agcggatgcc gccagccagg ccaagaccct gttcctggcc 114 0 
cgcatgagcc atgaaatccg tactcccctg tacggtgtcc ttggcaccct ggagttgctc 1200 
gacctgacca ccctgaacga gcggcaacgc gcctacctac gcaccatcca gagttcgtct 12 60 
gcgacgctca tgcaactgat tagcgatgtg ctggatgtct cgaagatcga agcggggcag 1320 
atggctctga ccctggccgc cttcaatccg ctggacctag tgcgggaagt gcttggcaac 1380 
tttgccgcca gcgccatggc caaggacctg caggtagacc cgctcgatac tcttgcgctt 144 0 
gaggcgcagg tcgcgcatgg cttcgaagaa agcgttctgt tcgaggttgc tggtggctcg 1500 
gtcggccatt tcgaagaggg tgtcgtcggc gttgtcgaac aacgcctgca acgcctgttt 1560 
cagctgcagc gccgccttgt cgcgcacctg cacgaggatg accggcaggc gccccgctcc 1620 
ggcgttcggc gacggctcgg aagcgaccct ggtcaggtgc accacattgg catcgttctg 1680 
catcgggact ctcctgccac cctcgcggcc gcgcatggaa tggcaaaaat cgggcacaga 174 0 
ggatcgattg gcgtcgtccg taacgtcaat ttccaggcgt caaaaacaag tatctacatt 1800 
cattatagag atactttcaa atctagatag 1830 

<210> 32 
<211> 1608 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 32 

atgatagtcg agcgcattct ggcgcgcttg cgcacccggc cggtggggga ggacgctcag 60 
cgtgtccatt ggatacgcgc tgatcgctat cgcgactcgg cgctggagat gttgggagtc 120 
gcccgggttg atctgccgga aacactctgg tggcacgacg agccgaacca tctgatcatc 180 
gctgcgagcc tgcttgatct caggcgaatc aatgacttcg aacagttggt tgagcgcccg 240 
gcattcgatt cgtacagcct ggtatcgccg gatggcgagg tattgctcgg cgcggcccct 300 
gcgaccggcc tgagggatgg cctgaacctc acccgacagg gggtcgccgt tcaactgcgc 360 
agccagcctg agaacggctg gctcgcggtc taccgaaccg actacggcaa tttctttcgc 420 
cactcccggt ggctggtggc aggtctgctg ctgaccccgg cgctgctcct ggccggttgg 480 
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ctcgggatgc gttggtacac cagcagcgtc gtcaacccgg tgcatcgggc gcaccggcaa 540 

ctggtggaga gcgacacctt cagccggacg ctgatacaga ccgcgccggt ggctctggtg 600 

gtgctgaccc aggatgacca gcaactggtg acctgcaacc acttggccgc ccagtggctg 660 

ggcgggccca cggagatcct tgggctgact tccaactgga agcttttcga tgcgcgtggg 720 

caggtaccag gagacatctg tatccaggtc ggtgggcgct atttgcagac cgccttcgcg 780 

gcgacccgct atgccggcac cgaggcggta ctgtgcgtat tcaacgacat cacggtccac 840 

tgcgaggcgg agaccgcgct gtccaatgcg aagcgagcag cggatgccgc cagccaggcc 900 

aagaccctgt tcctggcccg catgagccat gaaatccgta ctcccctgta cggtgtcctt 960 

ggcaccctgg agttgctcga cctgaccacc ctgaacgagc ggcaacgcgc ctacctacgc 1020 

accatccaga gttcgtctgc gacgctcatg caactgatta gcgatgtgct ggatgtctcg 1080 

aagatcgaag cggggcagat ggctctgacc ctggccgcct tcaatccgct ggacctagtg 1140 

cgggaagtgc ttggcaactt tgccgccagc gccatggcca aggacctgca ggtagacccg 1200 

ctcgatactc ttgcgcttga ggcgcaggtc gcgcatggct tcgaagaaag cgttctgttc 1260 

gaggttgctg gtggctcggt cggccatttc gaagagggtg tcgtcggcgt tgtcgaacaa 1320 

cgcctgcaac gcctgtttca gctgcagcgc cgccttgtcg cgcacctgca cgaggatgac 1380 

cggcaggcgc cccgctccgg cgttcggcga cggctcggaa gcgaccctgg tcaggtgcac 1440 

cacattggca tcgttctgca tcgggactct cctgccaccc tcgcggccgc gcatggaatg 1500 

gcaaaaatcg ggcacagagg atcgattggc gtcgtccgta acgtcaattt ccaggcgtca 1560 

aaaacaagta tctacattca ttatagagat actttcaaat ctagatag 1608 

<210> 33 
<211> 1500 
<212> DNA 

<213> Pseudomonas aeruginosa PA14 
<400> 33 . 

atgttgggag tcgcccgggt 
catctgatca tcgctgcgag 
gttgagcgcc cggcattcga 
ggcgcggccc ctgcgaccgg 
gttcaactgc gcagccagcc 
aatttctttc gccactcccg 
ctggccggtt ggctcgggat 
gcgcaccggc aactggtgga 
gtggctctgg tggtgctgac 
gcccagtggc tgggcgggcc 
gatgcgcgtg ggcaggtacc 
accgccttcg cggcgacccg 
atcacggtcc actgcgaggc 
gccagccagg ccaagaccct 
tacggtgtcc ttggcaccct 
gcctacctac gcaccatcca 
ctggatgtct cgaagatcga 
ctggacctag tgcgggaagt 
caggtagacc cgctcgatac 
agcgttctgt tcgaggttgc 
gttgtcgaac aacgcctgca 
cacgaggatg accggcaggc 
ggtcaggtgc accacattgg 
gcgcatggaa tggcaaaaat 
ttccaggcgt caaaaacaag 



<210> 34 
<211> 1122 
<212> DNA 
<213> Pseudomonas aeruginosa PA14 



tgatchgccg gaaacactct 
cctgcttgat ctcaggcgaa 
ttcgtacagc ctggtatcgc 
cctgagggat ggcctgaacc 
tgagaacggc tggctcgcgg 
gtggctggtg gcaggtctgc 
gcgttggfcac accagcagcg 
gagcgacacc ttcagccgga 
ccaggatgac cagcaactgg 
cacggagatc cttgggctga 
aggagacatc tgtatccagg 
ctatgccggc accgaggcgg 
ggagaccgcg ctgtccaatg 
gttcctggcc cgcatgagcc 
ggagttgctc gacctgacca 
gagttcgtct gcgacgctca 
agcggggcag atggctctga 
gcttggcaac tttgccgcca 
tcttgcgctt gaggcgcagg 
tggtggctcg gtcggccatt 
acgcctgttt cagctgcagc 
gccccgctcc ggcgttcggc 
catcgttctg catcgggact 
cgggcacaga ggatcgattg 
tatctacatt cattatagag 



ggtggcacga cgagccgaac 60 
tcaatgactt cgaacagttg 120 
cggatggcga ggtattgctc 180 
tcacccgaca gggggtcgcc 240 
tctaccgaac cgactacggc 300 
tgctgacccc ggcgctgctc 360 
tcgtcaaccc ggtgcatcgg 420 
cgctgataca gaccgcgccg 480 
tgacctgcaa ccacttggcc 540 
cttccaactg gaagcttttc 600 
tcggtgggcg ctatttgcag 660 
tactgtgcgt attcaacgac 720 
cgaagcgagc agcggatgcc 780 
atgaaatccg tactcccctg 840 
ccctgaacga gcggcaacgc 900 
tgcaactgat tagcgatgtg 960 
ccctggccgc cttcaatccg 1020 
gcgccatggc caaggacctg 108 0 
tcgcgcatgg cttcgaagaa 1140 
tcgaagaggg tgtcgtcggc 1200 
gccgccttgt cgcgcacctg 1260 
gacggctcgg aagcgaccct 1320 
ctcctgccac cctcgcggcc 1380 
gcgtcgtccg taacgtcaat 1440 
atactttcaa atctagatag 1500 
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<400> 34 

atgcgttggt acaccagcag cgtcgtcaac ccggtgcatc gggcgcaccg gcaactggtg 60 
gagagcgaca ccttcagccg gacgctgata cagaccgcgc cggtggctct ggtggtgctg 120 
acccaggatg accagcaact ggtgacctgc aaccacttgg ccgcccagtg gctgggcggg 180 
cccacggaga tccttgggct gacttccaac tggaagcttt tcgatgcgcg tgggcaggta 240 
ccaggagaca tctgtatcca ggtcggtggg cgctatttgc agaccgcctt cgcggcgacc 300 
cgctatgccg gcaccgaggc ggtactgtgc gtattcaacg acatcacggt ccactgcgag 360 
gcggagaccg cgctgtccaa tgcgaagcga gcagcggatg ccgccagcca ggccaagacc 420 
ctgttcctgg cccgcatgag ccatgaaatc cgtactcccc tgtacggtgt ccttggcacc 480 
ctggagttgc tcgacctgac caccctgaac gagcggcaac gcgcctacct acgcaccatc 540 
cagagttcgt ctgcgacgct catgcaactg attagcgatg tgctggatgt ctcgaagatc 600 
gaagcggggc agatggctct gaccctggcc gccttcaatc cgctggacct agtgcgggaa 660 
gtgcttggca actttgccgc cagcgccatg gccaaggacc tgcaggtaga cccgctcgat 720 
actcttgcgc ttgaggcgca ggtcgcgcat ggcttcgaag aaagcgttct gttcgaggtt 780 
gctggtggct cggtcggcca tttcgaagag ggtgtcgtcg ^gcgttgtcga acaacgcctg 840 
caacgcctgt ttcagctgca gcgccgcctt gtcgcgcacc tgcacgagga tgaccggcag 900 
gcgccccgct ccggcgttcg gcgacggctc ggaagcgacc ctggtcaggt gcaccacatt 960 
ggcatcgttc tgcatcggga ctctcctgcc accctcgcgg ccgcgcatgg aatggcaaaa 1020 
atcgggcaca gaggatcgat tggcgtcgtc cgtaacgtca atttccaggc gtcaaaaaca 1080 
agtatctaca ttcattatag agatactttc aaatctagat ag 1122 

<210> 35 
<211> 663 
<212> PRT 

<213> Pseudotnonas aeruginosa PA14 



<400> 35 



Met 


Asp 


Val 


He 


Arg 


Glu 


His 


Glu 


Val 


Phe 


Leu 


Gly Arg He Ala Arg 


1 








5 










10 




15 


Lys 


Ser 


Asp 


Lys 


Thr 


Thr 


Gin 


Lys 


Tyr 


Asp 


Tyr 


Asp Val Val Pro Leu 








20 










25 






30 


Gin 


Arg 


His 


Leu 


Leu 


Ala 


Lys 


Glu 


Asn 


Gly 


Leu 


Ala Val Tyr Glu Gly 






35 










40 








45 


Arg 


Glu 


Phe 


Ser 


Phe 


Ala 


Met 


Pro 


Phe 


Leu 


Leu 


Ala Thr Lys His Ala 




50 










55 










60 


Leu 


Ser 


Ala 


Asp 


Ser 


Ser 


Gly 


Asp 


Pro 


Phe 


Ser 


Leu Gly Val Leu Leu 


65 










70 










75 


80 


Ala 


Asn 


Phe 


Tyr 


Gly 


Ser 


Phe 


Trp 


Ser 


Val 


Ser 


Ala Tyr Pro Ala Pro 










85 










90 




95 


Gin 


Leu 


Leu 


He 


Phe 


Asp 


Leu 


Ser 


Gly 


Ser 


Thr 


Arg Leu Ala Val Pro 








100 










105 






110 


Ser 


He 


Pro 


Ser 


Thr 


Ala 


Gin 


Arg 


Asp 


Arg 


Leu 


Ser Gly Ser Tyr Pro 






115 










120 








125 


Met 


He 


Val 


Glu 


Arg 


He 


Leu 


Ala 


Arg 


Leu 


Arg 


Thr Arg Pro Val Gly 




130 










135 










140 


Glu 


Asp 


Ala 


Gin 


Arg 


Val 


His 


Trp 


He 


Arg 


Ala 


Asp Arg Tyr Arg Asp 


145 










150 










155 


160 


Ser 


Ala 


Leu 


Glu 


Met 


Leu 


Gly 


Val 


Ala 


Arg 


Val 


Asp Leu Pro Glu Thr 










165 










170 




175 


Leu 


Trp 


Trp 


His 


Asp 


Glu 


Pro 


Asn 


His 


Leu 


He 


He Ala Ala Ser Leu 








180 










185 






190 


Leu 


Asp 


Leu 


Arg 


Arg 


He 


Asn 


Asp 


Phe 


Glu 


Gin 


Leu Val Glu Arg Pro 






195 










200 








205 


Ala 


Phe 


Asp 


Ser 


Ty r 


Ser 


Leu 


Val 


Ser 


Pro 


Asp 


Gly Glu Val Leu Leu 




210 










215 










220 


Gly 


Ala 


Ala 


Pro 


Ala 


Thr 


Gly 


Leu 


Arg 


Asp 


Gly 


Leu Asn Leu Thr Arg 


225 










230 










235 


240 


Gin 


Gly 


Val 


Ala 


Val 


Gin 


Leu 


Arg 


Ser 


Gin 


Pro 


Glu Asn Gly Trp Leu 
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TV "I _ 

Ala 


Val 


Tyr 


Arg Thr Asp 


Tyr Gly 








OCA 




T «n 

lieu 


vai 


111 q 

Aia 


Giy lieu Lieu 


Leu Thr 






one 
z /b 






Leu 


Giy 


wee 


Arg i rp i y r 


inr oer 




290 






z yb 


ai a 

Ala 


rilS 


Arg 


m n T.exii Va "1 
VjJ.I1 JjcU Val 


uIU ocl 


*3 f\ C 

3Ub 






J 1U 




bin 


ixir 


ai a 
Aia 


fiO Val Aia 


Lieu vai 








IOC 

32b 




Leu 


vai 


inr 


Cys Asn His 


Lieu Ala 








340 




Glu 


lie 


Leu 


Gly Lieu inr 


Ser Asn 






355 




360 


Gin 


vai 


Pro 


Giy Asp lie 


Cys He 




370 






375 


Thr 


Ala 


pne 


Aia Aia inr 


Arg Tyr 


385 






o o n 
390 




vai 




Asn 


Asp i±e inr 


Va 1 V-? a 
Vai IllS 








/I AC 

4Ub 




Asn 


Ala 


Lys 


Arg Aia Aia 


Asp Aia 








42 0 




Leu 


Ala 


Arg 


Met Ser His 


Pl Tl ^ 

Glu lie 






435 




A A f\ 

44 0 


Gly 


Thr 


Leu 


Glu Leu Leu 


Asp Leu 




450 






455 


Ala 


Tyr 


Leu 


Arg Thr lie 


Gin Ser 


465 






470 




lie 


Ser 


Asp 


Val Leu Asp 


vai ber 








485 




Leu 


Thr 


Leu 


Ala Aia Fne 


Asn Pro 








f— n A 

bOO 




Gly 


Asn 


irne 


Aia Aia oer 


Aia . wee 






C 1 c 

bib 




con 
bz U 


Leu 


Asp 


TVi v 


lieu Ala JjcU 


m 11 ai a 
vjIU Ala. 




530 






535 


Ser 


Val 


Leu 


pne Glu val 


Ala Gly 


545 






550 




Gly 


vai 


vai 


Giy vai vai 


G1U Gin 








565 




Gin 


Arg 


Arg 


Leu Val Ala 


His Leu 








580 






Ser 


Gly 


Val Aircr Atcj 


Arg Leu 






595 




600 


His 


lie 


Gly 


He Val Leu 


His Arg 




610 






615 


Ala 


His 


Gly 


Met Ala Lys 


He Gly 


625 






630 




Arg 


Asn 


Val 


Asn Phe Gin 


Ala Ser 






645 




Arg 


Asp 


Thr 


Phe Lys Ser 


Arg 



660 



zbU 




Z bb 




noli ItHc 


irne Arg ms 


oer Arg 


Trp 


^ob 




z /U 




Dm Ala 
riO Aia 


T,on T.^ii TiAii 

iicu iteu jjcu 


Ala n"l \/ 
Ala biy 


Trp 










oci vai 


Val Aon Prn 
VAX nail tr 


vai iris 


Arg 




3 UU 






nop llll 


DVio Oat* Srn 

xr ne oer Aiy 


inr Leu 


lie 




31b 




"5 o n 
J z V 


vai Lieu 


inr Gin Asp 


Asp Gin 


Pl t-j 

win 






lie 
33b 




Ai sa PI t-i 

Aia Gin 


irp Lieu Giy 


Gly Pro 


inr 


O A C 

34b 




OCA 

3 bO 




Trp Lys 


Leu Phe Asp 


Ala Arg 


pi 
Giy 




3 ob 






H"\ r» 17a 1 

oin vai 


oiy biy Aiy 


iyr iicu 


m n 

Ulii 




t o n 
3 oU 






J\l a r>1 ir 

Aia Giy 


mr oiu Aia 


X7a 1 Ton 

vai lieu 


cys 




OQC 

3 y b 




Ann 


Cys Glu 


T\l -> pi,, rpl- -v- 

Aia giu inr 


Aia Leu 


Ser 


/in 
410 




41b 




Aia ber 


Gin Aia Liys 


inr Jjcu 


fne 


A O C 

42b 




/ion 
43 U 




Arg inr 


fro Lieu iyr 


Giy vai 


Leu 




>i >i c 
44b 






Thr Thr 


Leu Asn Glu 


Arg Gin 


Arg 




460 






Ser Ser 


Ala Thr Leu 


tut *_ ply, 

wee Gin 


Leu 




475 




/ Q ft 


Lys He 


GIU Aia Giy 


Gin wee 


AT a 

Aia 


490 




A QC 




Leu Asp 


Lieu vai Arg 


pl n \Ts3 1 
GIU Val 


Leu 


bOb 




ci n 
blU 




Al a liy S 


Asp lieu lain 


\/a 1 A Qn 

vai nap 


T3t*p 
riu 




rnr 

bzb 






Gin vai 


Ala Ui' q ' nl i; 

Aia ms Giy 


irne Vjiu 


pi n 




540 






Gly Ser 


val Giy his 


pne giu 


GIU 




555 




C A 

boO 


Arg Leu 


Gin Arg Leu 


rile vjin 


Leu 


570 




c o c • 

b /b 




HIS G1U 


Asp Asp Arg 


pi- 7\ 1 -a 

Gin Aia 


Pro 


585 




con 
b90 




Gly Ser 


Asp Pro Gly 


Gin Val 


His 




605 






Asp Ser 


Pro Ala Thr 


Leu Ala 


Ala 




620 






His Arg 


Gly Ser He 


Gly Val 


Val 




635 




640 


Lys Thr 


Ser He Tyr 


He His 


Tyr 


650 




655 





<210> 36 
<211> 609 
<212> PRT 
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<213> Pseudomonas aeruginosa PA14 
<400> 36 

Met Pro Phe Leu Leu Ala Thr Lys HiB Ala Leu Ser Ala Asp Ser Ser 

15 10 15 

Gly Asp Pro Phe Ser Leu Gly Val Leu Leu Ala Asn Phe Tyr Gly Ser 

20 25 30 

Phe Trp Ser Val Ser Ala Tyr Pro Ala Pro Gin Leu Leu He Phe Asp 

35 40 45 

Leu Ser Gly Ser Thr Arg Leu Ala Val Pro Ser He Pro Ser Thr Ala 

50 * 55 60 

Gin Arg Asp Arg Leu Ser Gly Ser Tyr Pro Met lie Val Glu Arg He 
65 ~ 70 75 80 

Leu Ala Arg Leu Arg Thr Arg Pro Val Gly Glu Asp Ala Gin Arg Val 

85 90 95 

His Trp He Arg Ala Asp Arg Tyr Arg Asp Ser Ala Leu Glu Met Leu 

100 105 HO 

Gly Val Ala Arg Val Asp Leu Pro Glu Thr Leu Trp Trp His Asp Glu 

115 120 125 

Pro Asn His Leu He He Ala Ala Ser Leu Leu Asp Leu Arg Arg He 

130 135 140 

Asn Asp Phe Glu Gin Leu Val Glu Arg Pro Ala Phe Asp Ser Tyr Ser 
145 150 155 160 

Leu Val Ser Pro Asp Gly Glu Val Leu Leu Gly Ala Ala Pro Ala Thr 

165 170 175 

Gly Leu Arg Asp Gly Leu Asn Leu Thr Arg Gin Gly Val Ala Val Gin 

180 185 190 

Leu Arg Ser Gin Pro Glu Asn Gly Trp Leu Ala Val Tyr Arg Thr Asp 

195 200 205 

Tyr Gly Asn Phe Phe Arg His Ser Arg Trp Leu Val Ala Gly Leu Leu 

210 215 220 

Leu Thr Pro Ala Leu Leu Leu Ala Gly Trp Leu Gly Met Arg Trp Tyr 
225 230 235 240 

Thr Ser Ser Val Val Asn Pro Val His Arg Ala His Arg Gin Leu Val 

245 250 255 

Glu Ser Asp Thr Phe Ser Arg Thr Leu He Gin Thr Ala Pro Val Ala 

260 265 270 

Leu Val Val Leu Thr Gin Asp Asp Gin Gin Leu Val Thr Cys Asn His 

275 280 285 

Leu Ala Ala Gin Trp Leu Gly Gly Pro Thr Glu He Leu Gly Leu Thr 

290 295 300 

Ser Asn Trp Lys Leu Phe Asp Ala Arg Gly Gin Val Pro Gly Asp He 
305 " 310 315 320 

Cys He Gin Val Gly Gly Arg Tyr Leu Gin Thr Ala Phe Ala Ala Thr 

325 * 330 335 

Arg Tyr Ala Gly Thr Glu Ala Val Leu Cys Val Phe Asn Asp He Thr 

340 345 350 

Val His Cys Glu Ala Glu Thr Ala Leu Ser Asn Ala Lys Arg Ala Ala 

355 360 365 

Asp Ala. Ala Ser Gin Ala Lys Thr Leu Phe Leu Ala. Arg Met Ser His 

370 375 380 

Glu He Arg Thr Pro Leu Tyr Gly Val Leu Gly Thr Leu Glu Leu Leu 
385 ~ 390 395 400 

Asp Leu Thr Thr Leu Asn Glu Arg Gin Arg Ala Tyr Leu Arg Thr He 

405 410 415 

Gin Ser Ser Ser Ala Thr Leu Met Gin Leu He Ser Asp Val Leu Asp 

420 425 430 

Val Ser Lys He Glu Ala Gly Gin Met Ala Leu Thr Leu Ala Ala Phe 
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435 




440 




445 








Asn 


Pro 


Leu Asp 


Leu Val 


Arg Glu 


Val 


Leu Gly Asn Phe 


Ala 


Ala 


Ser 




450 






455 




460 








Ala 


Met 


Ala Lys 


Asp Leu 


Gin Val 


Asp 


Pro Leu Asp Thr 


Leu 


Ala 


Leu 


465 






470 






475 






480 


Glu 


Ala 


Gin Val 


Ala His 


Gly Phe 


Glu 


Glu Ser Val Leu 


Phe 


Glu 


Val 








485 






490 




495 




Ala 


Gly 


Gly Ser 


Val Gly 


His Phe 


Glu 


Glu Gly Val Val 


Gly 


Val 


Val 






500 






505 




510 






Glu 


Gin 


Arg Leu 


Gin Arg 


Leu Phe 


Gin 


Leu Gin Arg Arg 


Leu 


Val 


Ala 






515 




520 




525 








His 


Leu 


His Glu 


Asp Asp 


Arg Gin 


Ala 


Pro Arg Ser Gly 


Val 


Arg 


Arg 




530 






535 




540 








Arg 


Leu 


Gly Ser 


Asp Pro 


Gly Gin 


Val 


His His lie Gly 


lie 


Val 


Leu 


545 






550 






555 






560 


His 


Arg 


Asp Ser 


Pro Ala 


Thr Leu 


Ala 


Ala Ala His Gly 


Met 


Ala 


Lys 








565 






570 




575 




lie 


Gly 


His Arg 


Gly Ser 


lie Gly 


Val 


Val Arg Asn Val 


Asn 


Phe 


Gin 






580 






585 




590 






Ala 


Ser 


Lys Thr 


Ser lie 


Tyr lie 


His 


Tyr Arg Asp Thr 


Phe 


Lys 


Ser 






595 




600 




605 









Arg 



<210> 37 
<211> 535 
<212> PRT 

<213> Pseudomonas aeruginosa PA14 



<400> 37 



Met 


He 


Val 


Glu 


Arg 


He 


Leu 


Ala 


Arg 


Leu Arg Thr Arg 


Pro 


Val 


Gly 


1 








5 










10 




15 




Glu 


Asp 


Ala 


Gin 


Arg 


Val 


His 


Trp 


He 


Arg Ala Asp Arg 


Tyr 


Arg 


Asp 








20 










25 




30 






Ser 


Ala 


Leu 


Glu 


Met 


Leu 


Gly 


val 


Ala 


Arg Val Asp Leu 


Pro 


Glu 


Thr 






35 










40 




45 








Leu 


Trp 


Trp 


His 


Asp 


Glu 


Pro 


Asn 


His 


Leu He He Ala 


Ala 


Ser 


Leu 




50 










55 






60 








Leu 


Asp 


Leu 


Arg 


Arg 


He 


Asn 


Asp 


Phe 


Glu Gin Leu Val 


Glu 


Arg 


Pro 


65 










70 








75 






80 


Ala 


Phe 


Asp 


Ser 


Tyr 


Ser 


Leu 


Val 


Ser 


Pro Asp Gly Glu 


Val 


Leu 


Leu 










85 










90 




95 




Gly 


Ala 


Ala 


Pro 


Ala 


Thr 


Gly 


Leu 


Arg 


Asp Gly Leu Asn 


Leu 


Thr 


Arg 








100 










105 




110 






Gin 


Gly 


Val 


Ala 


Val 


Gin 


Leu 


Arg 


Ser 


Gin Pro Glu Asn 


Gly 


Trp 


Leu 






115 










120 




125 








Ala 


Val 


Tyr 


Arg 


Thr 


Asp 


Tyr 


Gly 


Asn 


Phe Phe Arg His 


Ser 


Arg 


Trp 




130 










135 






140 








Leu 


Val 


Ala 


Gly 


Leu 


Leu 


Leu 


Thr 


Pro 


Ala Leu Leu Leu 


Ala 


Gly 


Trp 


145 










150 








155 






160 


Leu 


Gly 


Met 


Arg 


Trp 


.Tyr 


■Thr 


Ser 


Ser 


Val Val Asn Pro 


Val 


His 


Arg 










165 










170 




175 




Ala 


His 


Arg 


Gin 


Leu 


Val 


Glu 


Ser 


Asp 


Thr Phe Ser Arg 


Thr 


Leu 


He 








180 










185 




190 






Gin 


Thr 


Ala 


Pro 


Val 


Ala 


Leu 


Val 


Val 


Leu Thr Gin Asp 


Asp 


Gin 


Gin 






195 










200 




205 








Leu 


Val 


Thr 


Cys 


Asn 


His 


Leu 


Ala 


Ala 


Gin Trp Leu Gly 


Gly 


Pro 


Thr 
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210 215 .220 

Glu lie Leu Gly Leu Thr Ser Asn Trp Lys Leu Phe Asp Ala Arg Gly 
225 230 235 240 

Gin Val Pro Gly Asp He Cys He Gin Val Gly Gly Arg Tyr Leu Gin 

245 250 255 

Thr Ala Phe Ala Ala Thr Arg Tyr Ala Gly Thr Glu Ala Val Leu Cys 

260 265 270 

Val Phe Asn Asp He Thr Val His Cys Glu Ala Glu Thr Ala Leu Ser 

275 280 285 

Asn Ala Lys Arg Ala Ala Asp Ala Ala Ser Gin Ala Lys Thr Leu Phe 

290 295 300 

Leu Ala Arg Met Ser His Glu He Arg Thr Pro Leu Tyr Gly Val Leu 
305 " 310 315 320 

Gly Thr Leu Glu Leu Leu Asp Leu Thr Thr Leu Asn Glu Arg Gin Arg 

325 330 335 

Ala Tyr Leu Arg Thr He Gin Ser Ser Ser Ala Thr Leu Met Gin Leu 

340 345 350 

He Ser Asp Val Leu Asp Val Ser Lys He Glu Ala Gly Gin Met Ala 

355 360 365 

Leu Thr Leu Ala Ala Phe Asn Pro Leu Asp Leu Val Arg Glu Val Leu 

370 375 380 

Gly Asn Phe Ala Ala Ser Ala Met Ala Lys Asp Leu Gin Val Asp Pro 
385 390 395 400 

Leu Asp Thr Leu Ala Leu Glu Ala Gin Val Ala His Gly Phe Glu Glu 

405 410 415 

Ser Val Leu Phe Glu Val Ala Gly Gly Ser Val Gly His Phe Glu Glu 

420 425 430 

Gly Val Val Gly Val Val Glu Gin Arg Leu Gin Arg Leu Phe Gin Leu 

435 440 445 

Gin Arg Arg Leu Val Ala His Leu His Glu Asp Asp Arg Gin Ala Pro 

450 455 460 

Arg Ser Gly Val Arg Arg Arg Leu Gly Ser Asp Pro Gly Gin Val His 
465 470 ~ 475 480 

His He Gly He Val Leu His Arg Asp Ser Pro Ala Thr Leu Ala Ala 

485 490 495 

Ala His Gly Met Ala Lys He Gly His Arg Gly Ser He Gly Val Val 

500 505 510 

Arg Asn Val Asn Phe Gin Ala Ser Lys Thr Ser He Tyr He His Tyr 

515 520 525 

Arg Asp Thr Phe Lys Ser Arg 
530 535 



<210> 38 • 
<211> 499 
<212> PRT 

<213> Pseudomonas aeruginosa PA14 
<400> 38 

Met Leu Gly Val Ala Arg Val Asp Leu Pro Glu Thr Leu Trp Trp His 

1 5 10 15 

Asp Glu Pro Asn His Leu He He Ala Ala Ser Leu Leu Asp Leu Arg 

20 25 30 

Arg He Asn Asp Phe Glu Gin Leu Val Glu Arg Pro Ala Phe Asp Ser 

35 40 45 

Tyr Ser Leu Val Ser Pro Asp Gly Glu Val Leu Leu Gly Ala Ala Pro 

50 55 60 

Ala Thr Gly Leu Arg Asp Gly Leu Asn Leu Thr Arg Gin Gly Val Ala 



-45- 



WO 03/004689 



PCT7US02/21431 



65 






/ u 


vai 


V3J.11 


Jjeu ATy oer 


win rro vjiu 






Q C 






Asp 


ryr i^iy Asn 


jrne rne -nxy 






t a a 
10U 




Leu 


Leu 


Lieu inr Fro 


Ala JjeU JJCU 






115 


ion 
120 


Trp 


Tyr 


Thr Ser Ser 


Tf-1 tt»1 7\ 01-1 

vai vai Asn 




130 




135 


Leu 


vai 


giu faer Asp 


inr pne oer 


145 






150 


VS. J. 




Lieu vai Vol 


lieu inr Liiii 






165 




Asn 


His 


Leu Ala Ala 


Gin Trp Leu 






180 




Leu 


lnr 


Ser Asn Trp 


Lys Leu Pne 






toe 
195 


2 U U 


Asp 


lie 


Cys lie Gin 


vai Giy tjiy 




210 




one 
215 


Ala 


Thr 


Arg Tyr Ala 


Gly Thr Glu 


225 






230 


lie 


Thr 


Val His Cys 


Glu Ala Glu 






245 




Ala 


Ala 


Asp Ala Ala 


Ser Gin Ala 






260 




Ser 


His 


Glu lie Arg 


Thr Pro Leu 






275 


280 


Leu 


Leu 


Asp Leu Thr 


Thr Leu Asn 




290 




295 


Thr 


lie 


Gin Ser Ser 


Ser Ala Thr 


305 






310 


Leu 


ASp 


Val Ser Lys 


lie giu Aia 






325 




Ala 


Phe 


Asn Pro Leu 


Asp Leu Val 






340 




Ala 


Ser 


Ala Met Ala 


Lys Asp Leu 






355 


360 


Ala 


Lieu 


G1U Ala Vain 


vai Aia nis 




370 




375 


Glu 


Val 


Ala Gly Gly 


ber vai Gly 


385 






390 


vai 


vai 


Glu Gin Arg 


Lieu CrLn Arg 






405 




Val 


Ala 


His Leu His 


Glu Asp Asp 






420 




Arg 


Arg 


rix y jjcu uiy 


.Q^r* A cn Pro 






435 


440 


Val 


Leu 


His Arg Asp 


Ser Pro Ala 




450 




455 


Ala 


Lys 


lie Gly His 


Arg Gly Ser 


465 






470 


Phe 


Gin 


Ala Ser Lys 


Thr Ser lie 






485 




Lys 


Ser 


Arg 







n c 
/5 




O A 


Asn 


vjiy irp lieu Aia vai 


Tyr 


Arg 




q a 

y 0 






n j. 0 


Car ZiYYT r T*T"T"\ T ,01 1 Vfl 1 

ocj, iuy i. -L ucu vai 


AT = 
nia 


Gl v 
uiy 


1 a c 
105 


110 






Leu 


Aia biy ny lie u. oiy 


Mot- 

1*1 eu 


Arg 




IOC 

12b 






Pro 


vai nis Arg Aia nis 


Arg 


bin 




140 






Arg 


inr ijeu lie tain inr 


Til a 

Aia 


Pro 




155 




"1 C A 

lbO 


Asp 


Asp bin vjin jjeu vai 


inr 


Cys 




170 


1/5 




Gly 


Gly pro inr vjriu lie 


Leu 


oiy 


185 


iy u 






Asp 


Al =» A>-n Pi v Pin T7a 1 

Aia Arg vjiy uin vai 


Pro 


uiy 




O A C 

205 






Arg 


iyr lieu uin inr Aid 


rrllC 


nla 




O O A 

22 0 






Aia 


vai lieu cys vai rrne 


Asn 


Asp 




235 




240 


Thr 


Ala Leu Ser Asn Ala 


Lys 


Arg 




250 


255 




Lys 


Thr Leu Phe Leu Ala 


Arg 


Met 


265 


270 






Tyr 


Gly val Leu Gly Tixr 


Leu 


GIU 




285 






Glu 


Arg Gin Arg Ala Tyr 


Leu 


Arg 




300 






Leu 


Met Gin Leu lie ber 


Asp 


vai 




315 




^ O A 

32 0 


Gly 


Gin Met Ala Leu Thr 


Leu 


Ala 




330 


335 




Arg 


C3LU vai Leu oiy Asn 


pne 


Aia 


345 


350 


inr 




Gln 


Val Asp Pro Leu Asp 


Leu 




365 






Gly 


Pne CjIu glu ber vai 


Leu 


pne 


380 






His 


Pne giu giu Giy vai 


vaL 


Giy 




395 




>1 A A 

400 


Leu 


Phe Gin Leu Gin Arg 


Arg 


Leu 




410 


415 




Arg 


Gin Aia pro Arg oer 




vai 


425 


430 






Gly 


Gin Val His His lie 


Gly 


lie 




445 






Thr 


Leu Ala Ala Ala His 


Gly 


Met 




460 






lie 


Gly Val Val Arg Asn 


Val 


Asn 




475 




480 


Tyr 


lie His Tyr Arg Asp 


Thr 


Phe 




490 


495 





<210> 39 
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<211> 373 
<212> PRT 

<213> Pseudomonas aeruginosa PA14 



<400> 39 



M«at- 


Arg 


Trn 
lip 


Tvr 
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Val 
v dl 
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Vdl Aoii riu Vdl 
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1 5 

XD 




Arg 


m n 
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Val 


Glu 
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PVio Qov Avp HPVi v 
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T.Qll XT ^ Pi y\ 

xjcu xxe bin 


1 ill 








20 










95 
^ o 






Ala 


IT 1 (J 


Val 


Ala 


TiPT 7 
JJCU 


Val 
val 


Val 
v dl 


T 

Leu 


XllX ulll nap AsLy 


Oil n /Tl 1 T,«ii 
Lor 111 blu XlCU 


Val 
VdX 






35 










40 




45 




THr 

XllX 




Asn 
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Al a 


Al a 
Aid 
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T'vr^ Tidii (til u 0.1 xr 
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lie 
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GO 

O L/ 
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XjGU, 


oi y 


Leu 


j. hi. 
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OCX 


Asn 


Trp 
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T.aii Pino A en 211 a 
UcU flic nop Aid 


7i V*/^f Pi 1 if P. T Tl 

Axy bxy bin 


Val 
Vdl 


65 










70 
/ l/ 
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AO 
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Asp 
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^»y o 
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X 1C 


ft! Tl 
L7111 


Val 
vdl 


vjiy vaiy Aiy xyx 


T.OII P.1 Tl T , Vl>" 

xicu bxn 1 in 


Al a 
Aid 










85 








QO 

LI 






tr 11C 


ai a 


Ala 


Thr 

lul 


Arg 


xyr 


Al a 
Aid 


uiy 


TVir f21 ii Ala Val 
XllX OlU Aid vdl 


T.qii pi/a T7a 1 

XjcU L.yS Val 


Jrllc 








100 










1 0 R 

1U J 


xxu 




Ash 


ASp 


Tl R 

11C 


Thr 

X 1IX 


Va 1 
val 


111 a 


cys 


ulll 


Ala dl ii TVir* Ala 
Aid bill lllx AXd 


Xicu oer ash 


Ala 
Aid 






115 










120 




19 5 
1^0 






Arg 


Al a 


AT a 
Aid 


ASp 


Al a 
Ala 


Al a 
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Ser 
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TVl a 
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i n 










1 "5 5 
1J3 




X*±U 






Arg 


FITS L- 
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ul LL 
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val JjcU bXy 
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1 50 

13 LI 






1 c;5 
133 




i fin 
10 u 
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m n 

ul U 
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A art 

AO p 


Leu 


1 111 


X 111 


T.oii Bom P.1 ii "A v*fr 
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PI T\ 7i ■V~fT A 1 n 

bxn Arg Ala 


Tyr 










165 








1 70 
1 lU 
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1/3 
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